Professional Documents
Culture Documents
Abstract
1. Introduction
As the life expectancy at birth increases, the survival at older age increases
which results in growing older population. It is estimated that the world el-
derly population will reach 70 to 80 years of life expectancy by 2050 according
∗ Corresponding author
Email addresses: deepak.vijay.14e@iitram.ac.in (Deepak Vijay),
hitesh.suthar.14e@iitram.ac.in (Hitesh Suthar)
2
ECG classes. In this work, the author used decision tree classifier for the anal-
ysis of ECG beats. In addition, Desai et al. [11] used recurrence quantification
method for the analysis of multiclass tachycardia beats using ensemble classifier.
Fahim et al. [12] used data mining approach with expectation-maximization
based clustering on fifty-compressed ECG signals. The author reduced the
number of feature by using correlation-based feature subset method using the
rule-based system.
In the recent work of Acharya en al. [13], they used convolution neural
network (CNN) for the classification of different classes of arrhythmias using
different intervals of ECG segments without any involvement of feature selection.
The CNN used in this work is of 11-layer deep. They obtained the accuracy
of 92.50% for two seconds of duration ECG sets and 94.50% for five seconds
of duration ECG sets. Although, they have achieved significant results but the
dataset required for the training is ample and it takes more time to train the
data.
From [7–13], we observed that most of the authors have used three classes
for the diagnosis of arrhythmia. Our paper focuses on four class (αf ib , αf l ,
νf ib and ηsr ) classification using the biorthogonal wavelet filter bank followed
by feature selection. We have extracted features namely: Norm (N1 ), Energy
(N2 ), Sample Entropy (Es ), Fuzzy Entropy (Ef ) and Renyi Entropy (Er ). We
achieved maximum accuracy of 98.00% for five seconds dataset and 96.30% for
two seconds of dataset which is better than the recent work of Acharya en al.
[13].
Biorthogonal wavelet filter banks are computationally fast, Due to which
they are the major tool for the classification of Electrocardiograph and Elec-
troencephalogram signals. Various techniques are used for the design of the
wavelet filter banks. The work of Sharma et al., [14] used an eigen-filter based
technique for the design of joint duration-bandwidth localized BWFB. This
method is numerically efficient as well as it can be incorporated in duration and
bandwidth domain simultaneously. In addition, a parametrization technique can
be used to design the joint duration-bandwidth localized discrete-time BWFB
as illustrated by Sharma et al. [15]. This techniques is employed where PR
and VMs conditions are needed to impose structurally. It provides the num-
ber of parameters constant regardless of the filter length. Performance of this
technique is found to be efficient in the applications of image coding [15].
In this paper, we are using joint duration-bandwidth localized (JDBWL)
discrete-time biorthogonal wavelet filter bank designed using parametrization
technique for the classification of arrhythmia. The technique used has the ad-
vantages of both the linear-phase and the joint duration bandwidth localization.
The proposed technique uses the optimal criterion taking into consideration the
JDBWL of all the basis sample of the analysis and synthesis FBs. Initially the
parametrized linear-phase PR two-channel filter bases are designed with non
arbitrary vanishing moments (VMs), degrees of freedom and lengths, then the
optimization of independent variables is done to achieve the wavelet FBs that
are optimized in duration and bandwidth.
Following are the salient features of the parametrization technique used:
3
–We have successfully achieved parametric aspects for analysis and synthesis
filter of 13/7 FBs. Due to unrestricted free parameters, parametrized filters
coefficients can be optimized.
–For predefined decomposition levels, the JDBWL of all discrete-time wavelet
bases is taken into the optimization criterion.
–The performance of proposed technique for designing optimized JDBWL FBs
for the detection of arrhythmia is found better than other methods.
2. Dataset Used
The test dataset in our work is obtained from PhysioBank, that includes
following databases: MIT-BIH atrial fibrillation (afdb), MIT-BIH arrhythmia
(mitdb) and Creighton university ventricular tachyarrhythmia (cudb) (Table 1),
which are lead-2 ECG signals only. The used databases are of different dura-
tions: two seconds and five seconds (table ref), these contain 21,709 (Dataset
A) and 8,683 (Dataset B) numbers of ECG segments, respectively (Table 2).
Databank Dataset
MIT-BIH arrhythmia (mitdb) αf ib , αf l , ηsr
MIT-BIH atrial fibrillation (afdb) αf ib , αf l
Creighton university ventricular tachyarrhythmia (cudb) νf ib
4
3. Methodology
Figure 1: Flow chart showing the methodology for automatic determination of different classes
of ECG signals
3.1. Pre-processing
In the database, the ECG signals received from MIT-BIH arrhythmia have
sampling frequency of 360 Hz and the database of ECG signals from MIT-BIH
αf ib and Creighton University ventricular tachyarrhythmia have the sampling
frequency of 250 Hz. Since the sampling frequency of the databases are different,
therefore we have down-sampled the frequency from 360 Hz to 250 Hz.
Normalized Amplitude
4 0
-2
2
-4
0
-6
0 0.5 1 1.5 2 0 0.5 1 1.5 2
Seconds Seconds
Atrial Flutter ECG Ventricular Fibrillation ECG
2 3
Normalized Amplitude
Normalized Amplitude
2
0 1
0
-2 -1
-2
-4 -3
0 0.5 1 1.5 2 0 0.5 1 1.5 2
Seconds Seconds
5
Normal ECG Atrial Fibrillation ECG
8 2
Normalized Amplitude
Normalized Amplitude
6
0
4
-2
2
-4
0
-2 -6
0 1 2 3 4 5 0 1 2 3 4 5
Seconds Seconds
Atrial Flutter ECG Ventricular Fibrillation ECG
6 3
Normalized Amplitude
Normalized Amplitude
4 2
1
2
0
0
-1
-2 -2
-4 -3
0 1 2 3 4 5 0 1 2 3 4 5
Seconds Seconds
6
where l=(2n + 1), n = 0, 1...N (integer)
As a result the perfect reconstruction condition is expressed as [16]
Where n−1
X n + m + 1
−1 n
S2n−2 (z) = (2 − z − z ) (4)
n
m=0
7
2. Duration-bandwidth localization and discrete wavelet bases
There is an uncertainty in the analysis of signal in time and frequency-
domains simultaneously as a signal cannot be arbitrarily localized in time
and frequency at the same time. In the work of Ishii and Furukawa [17],
they have given variance based duration bandwidth localization measure
for sequences in l2 (Z). Let b(m) is defined as a sequence of discrete time
real-valued in l2 (Z) with |b|22 = 1 and discrete-time fourier transform of
2
b(m) is B(ω). The time-variance ςm and the mean m0 of the sequence is
expressed as:
X
m0 = m|b(m)|2 (11)
m∈Z
X
2
ςm = (m − m0 )2 |b(m)|2 (12)
m∈Z
Let us denote the product of time and frequency variance of the sequence
b(m) by Ψb . The lower bound of the TFP is decided by the uncertainty
principle which can be expressed as:
2 2 (1 − |B(π)|)2
Ψb = ς m ςω ≥ (14)
4
Frequency mean ω0 and frequency variance ςω2 of a band-pass sequence
b(n) are as follows [18]:
ˆ ˆ
1 π 2 2 1 π
ω0 = ω|B(ω)| dω, ςω = (ω − ω0 )2 |B(ω)|2 dω (15)
π 0 π 0
From the equation (5) and (8), the lower bound can be given to the TFP
for a band-pass sequence using the inequality (7) as [18]:
2 2 (1 − η)2
Ψb = ςm ςω ≥ (16)
4
ω0 2 ω0
where η = π |B(0)| + 1− π |B(π)|2
8
Hence using equation (10) and inequality (7) TFP for a high-pass sequence
is obtained as [18]:
2 2 (1 − |B(0)|)2
Ψb = ς m ςω ≥ (18)
4
Interestingly, similar to the continuous time function, for low-pass, high-
pass and band-pass sequence in l2 Z, the time-frequency product is also
lower bounded. It is due to null of the spectrum of the sequence at ω = π
and ω = 0. The synthesis and analysis high-pass filter should have one
zero at ω = 0. The generated iterated filters should have at least one zero
at ω = π and other at ω = 0 for band-pass discrete wavelet sequence.
Hence, the TFP of all the sequences and filters is bounded by 0.25 that is:
Y ( z)
Yˆ ( z )
2 2 1
Ψb = ς m ςω ≥ (19)
4
For two channel biorthogonal filter bank, we have chosen the i number
of iterations to decompose a sequence into i + 1 sub-bands. For i = 4
iterations, Figure 4 illustrates the structure of ALF and SLF. The tree
type structure of iterated FB can be reconstructed as an equivalent parallel
structured filter bank (Figure 5) having i + 1 parallel branches (filters)
using the noble identities of [19]. In the Figure 4 the lower-most branch
9
is high-pass filter, upper-most branch is low-pass filter and other branches
represent the band-pass filters. We minimize TFP for all the filters (low-
pass, high-pass and band-pass filter). Expression for the i + 1 analysis-
filters for parallel structured FB
B11 (z) = B1 (z) (20)
j−2
j−1 Y k
B1j (z) = B1 (z 2 ) B0 (z 2 ), j = 2, 3, ..., i (21)
k=0
i−1
Y k
B0i (z) = B0 (z 2 ) (22)
k=0
and expression for i + 1 synthesis-filters is as follows:
G11 (z) = G1 (z) (23)
j−2
j−1 Y k
Gj1 (z) = G1 (z 2 ) G0 (z 2 ), j = 2, 3, ..., i (24)
k=0
i−1
Y k
Gi0 (z) = G0 (z 2 ) (25)
k=0
B⁴₀(z) 16 16 G⁴₀(z)
B⁴₁(z) 16 16 G⁴₁(z)
Y ( z) B³₁(z) 8 8 G³₁(z) Yˆ ( z )
B²₁(z) 4 4 G²₁(z)
B¹₁(z) 2 2 G¹₁(z)
Let, in time-domain, bj1 (m), j = 1, ..., i and b10 represents the parallel-filters
of parallel structured AFB which can be regarded as wavelet-vectors/sequences.
These vectors are known as discrete time analysis wavelet-basis and it is
denoted as ∆. Similarly, we define parallel filters of parallel structured
SFB by g1j (m), j = 1, ..., i and g01 . The vectors are known as discrete time
synthesis wavelet-basis (WB), represented by ∆. ˜ Therefore, the uncer-
tainty principle can be defined for discrete time analysis and synthesis
bases. For µ = {b1 , b2 , b3 , ...bM } this basis, that contains M discrete-
wavelet-vectors, the expression of TFP is defined as:
M
X
Ψµ = w k Ψb k (26)
k=1
10
PM
wk represents the weights such that 0 ≤ wk ≤ 1 and k=1 wk = 1. Ψµ is
weighted-average of time-frequency products of the wavelet vectors. The
uncertainty relation for µ is:
M
1 X
Ψµ = wk Ψbk ≥ 0.25 (27)
M
k=1
Therefore, the TFP for discrete time is same as continuous time analysis
and synthesis WBs that is bounded by 0.25.
3. Optimization method
To obtain the JDBWL, the objective function is minimized.
P = rΨ∆ + (1 − r)Ψ∆
˜ , r ∈ [0, 1] (28)
The optimization process for all the steps are explained as following:
i. The numbers of free-parameters, VMs and lengths of the filter
pair as well as decomposition levels are fixed .
ii. The expressions for G0 (z) and B0 (z) are obtained by using parametriza-
tion technique.
iii. The region where the filter banks are regular, find the value of
independent parameter. The cascade-algorithm converges in this
region.
iv. Using the range mentioned above, the initial value for free-parameters
are selected.
v. Optimization of the objective function P is performed using
f mincon solver optimization toolbox of MATLAB.
vi. Inspect the value of function P whether it is nearer to the lower-
bound 0.25, otherwise go back to the step iv and repeat the
process until lower bound is close to 0.25.
11
Index f0 h0
0 0.586549684753843 0.575285420160546
1 0.260676013170069 0.300071355040137
2 -0.0683046054662545 -0.0376427100802732
3 -0.00979969727517512 -0.0500713550401366
4 0.0238641104514918 –
5 -0.000876315894894261 –
6 0.00116565263784130 –
1
f0 : Analysis Filter Coefficients; h0 : Synthesis Filter Coefficents
Table 3: Filter coefficients of the Analysis and Synthesis Filter Bank.
4. Features used
4.1. Norm (N )
The Lp norm of y (β) is defined as:
M 1/p
1 X (β) p
(β)
||Nβp || Nβp 1/p
||y ||p = = = |y | (31)
M i=1
where p≥1 has the same dimensions as the corresponding signals y (β) . The
norm adds two trends: β decrease with the power 1/p and a strong increase
caused by the power p. Thus the norm can be generalised to represent the
norms from the least to the maximum, which regard to the orders p = −∞ and
p = ∞, respectively. This includes the absolute mean (p = 1) and the rms value
(p = 2) as special cases. The norm can be achieved as the norm for the norms
of individual samples.
12
4.2. Sample Entropy (Es )
Real-valued discrete time signal is defined as x = (x1 , x2 , ..., xN ) of length
N . At each state of x, a vector including the n-th value is defined as Xtn =
{xt , xt+1 , ..., xt+n−2 , xt+n−1 } where t = 1, 2, ..., N − (n − 1) and n is the embed-
ding dimension that represents number of samples in each vector. The absolute
value of the difference between Vt1 and Vt2 is the distance L(Vt1 , Vt2 ) between
these vectors. If the distance L(Vt1 , Vt2 ) is lessor than predefined tolerance s, the
probability P m (s) represents the n-dimensional matched vectors [22]. Therefore
P m+1 (s)
the sample entropy is defined as: Es (x, n, s) = − ln P m (s)
5. Results
The confusion matrix for the five seconds dataset and two seconds dataset are
shown in the Table 4 and Table 5. From the Table 4, it is seen that the 99.31%
ECG-segments of αf ib for five seconds has been classified correctly. Whereas,
more than 12% of the ECG-segments of αf l are wrongly classified as αf ib and
ηsr . Similarly, more than 12% of νf ib signals are wrongly classified as αf ib .
From the Table 5, it is observed that 97.98% ECG-segments of αf ib has
been classified correctly. However, more than 18% of νf ib signals are wrongly
classified as αf ib , αf l and ηsr .
13
From Tables (4, 5), the overall classification for dataset A and B is done
with the accuracy, sensitivity and PPV of 98.00%, 91.20% & 94.78% for dataset
A and 96.30%, 88.61% & 89.48% for dataset B, respectively.
Figure 6 and 7 show the ROC curve for the classification. The Area under
curve (AUC) obtained is 0.99 and 0.98 for five seconds dataset and two seconds
dataset, respectively.
14
1
0.8
AUC = 0.99
0.4
0.2
0.8
True positive rate
0.6
AUC = 0.98
0.4
0.2
15
6. Discussion
From results, the first thing observed is that the performance of five seconds
and two seconds databases are comparable. The five seconds data performed
slightly well than the two seconds data as the five seconds database contain
more duration i.e. three additional seconds.
The data samples of νf ib taken in this study is very less (161 and 65 ECG-
segments) as compared to the other data samples. Therefore, it is the main
reason for achieving significantly less sensitivity and PPV.
The accuracy obtained is comparable to the various studies of Table 6. We
achieved accuracies of 93.50% and 89.60% for dataset A and B, respectively by
using total 86827 number of ECG-segments (902 ηsr , 18804 αf ib , 1840 αf l and
162 νf ib ) of five seconds database and total 8683 number of ECG-segments (361
ηsr , 7521 αf ib , 736 αf l and 65 νf ib ) of two seconds database. We used norm
(N1 ), energy (N2 ), reiny entropy (Er ), fuzzy entropy (Ef ) and sample entropy
(Es ) as a discriminating features. Using 13/7 biorthogonal wavelet filter bank,
the performance of our method is robust.
16
Table 6: Performance comparison of our work with previous works in the field of detection of
arrhythmia
17
SB 1 SB 2 SB 3
150
35
20
30
15 25 100
20
10
15
50
10
5
5
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
200
80
150
60
40 100
20
50
0
0 1 2 3 0 1 2 3
(a) Norm
SB 1 SB 2 SB 3
60
2.5 500
50
2 400
40
1.5 300
30
1 200
20
0.5 10 100
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
300
3000
250
2500
200 2000
150 1500
100 1000
50 500
0 0
0 1 2 3 0 1 2 3
(b) Energy
18
SB 1 SB 2 SB 3
1.4
0.12 0.3
1.2
0.25
0.1
1
0.2
0.08
0.8
0.15
0.06
0.6
0.1
0.04 0.4
0.05
0.02 0.2
0
0 0
-0.05
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
1.4
1.5
1.2
1
1
0.8
0.6
0.5
0.4
0.2
0
0 1 2 3 0 1 2 3
2 -1
4 -2
0
-3
2
-4
-2
-5
0
-4 -6
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
1 -4
0
-1 -5
-2
-6
-3
-4 -7
-5
-8
-6
0 1 2 3 0 1 2 3
19
SB 1 SB 2 SB 3
2.5
2 2
1.5
1.5 1.5
1
1
1
0.5
0.5
0.5
0
0 0
-0.5
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
2.5 2
2
1.5
1.5
1 1
0.5
0.5
0
-0.5 0
0 1 2 3 0 1 2 3
Figure 6: Box plots for normal vs different types of arrhythmia for two seconds
20
SB 1 SB 2 SB 3
350
50 350
300
300
40
250
250
30 200 200
150 150
20
100 100
10
50 50
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
350
250
300
200
250
150
200
100
150
50
100
0 50
0 1 2 3 0 1 2 3
(f) Norm
SB 1 SB 2 SB 3
5 500 800
4 400
600
3 300
400
2 200
200
1 100
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
800 4000
600 3000
400 2000
200 1000
0 0
0 1 2 3 0 1 2 3
(g) Energy
21
SB 1 SB 2 SB 3
0.4
1.2
0.1
1
0.3
0.08
0.8
0.06
0.2
0.6
0.04
0.4
0.1
0.02
0.2
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
1.4
1.2 1.5
1
0.8
1
0.6
0.4
0.5
0.2
0 1 2 3 0 1 2 3
1.5 2.5
1.5
2
1
1 1.5
0.5 1
0.5
0.5
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
2.5 2.5
2 2
1.5 1.5
1 1
0.5 0.5
0 0
0 1 2 3 0 1 2 3
22
SB 1 SB 2 SB 3
5 -1
2
4
-2
3 0
-3
2
-2 -4
1
-5
0 -4
-1 -6
-6
-2 -7
0 1 2 3 0 1 2 3 0 1 2 3
SB 4 SB 5
0
-4.5
-1
-5
-2 -5.5
-3 -6
-6.5
-4
-7
-5
-7.5
-6
-8
-7
0 1 2 3 0 1 2 3
Figure 5: Box plots for normal vs different types of arrhythmia for five seconds
7. Conclusion
References
[1] U. Nations, World population ageing 2015, New York: Department of Eco-
nomic and Social Affairs PD.
23
[2] P. Anversa, T. Palackal, E. H. Sonnenblick, G. Olivetti, L. G. Meggs, J. M.
Capasso, Myocyte cell loss and myocyte cellular hyperplasia in the hyper-
trophied aging rat heart., Circulation research 67 (4) (1990) 871–885.
[3] R. Acharya, S. M. Krishnan, J. A. Spaan, J. S. Suri, Advances in cardiac
signal processing, Springer, 2007.
[4] K. Najarian, R. Splinter, Biomedical signal and image processing, CRC
press, 2012.
[5] A. L. Goldberger, Clinical Electrocardiography E-Book: A Simplified Ap-
proach, Elsevier Health Sciences, 2012.
[6] R. J. Martis, U. R. Acharya, H. Adeli, Current methods in electrocar-
diogram characterization, Computers in biology and medicine 48 (2014)
133–149.
[7] R. J. Martis, U. R. Acharya, H. Prasad, C. K. Chua, C. M. Lim, J. S. Suri,
Application of higher order statistics for atrial arrhythmia classification,
Biomedical Signal Processing and Control 8 (6) (2013) 888–900.
[8] R. J. Martis, U. R. Acharya, H. Adeli, H. Prasad, J. H. Tan, K. C. Chua,
C. L. Too, S. W. J. Yeo, L. Tong, Computer aided diagnosis of atrial
arrhythmia using dimensionality reduction methods on transform domain
representation, Biomedical Signal Processing and Control 13 (2014) 295–
305.
[9] Y. Wang, Y.-S. Zhu, N. V. Thakor, Y.-H. Xu, A short-time multifractal
approach for arrhythmia detection based on fuzzy neural network, IEEE
Transactions on Biomedical Engineering 48 (9) (2001) 989–995.
[10] U. R. Acharya, H. Fujita, M. Adam, O. S. Lih, T. J. Hong, V. K. Sudar-
shan, J. E. Koh, Automated characterization of arrhythmias using nonlin-
ear features from tachycardia ecg beats, in: Systems, Man, and Cybernetics
(SMC), 2016 IEEE International Conference on, IEEE, 2016, pp. 000533–
000538.
[11] U. Desai, R. J. Martis, U. R. Acharya, C. G. Nayak, G. Seshikala,
R. SHETTY K, Diagnosis of multiclass tachycardia beats using recurrence
quantification analysis and ensemble classifiers, Journal of Mechanics in
Medicine and Biology 16 (01) (2016) 1640005.
[12] F. Sufi, I. Khalil, Diagnosis of cardiovascular abnormalities from com-
pressed ecg: a data mining-based approach, IEEE Transactions on Infor-
mation Technology in Biomedicine 15 (1) (2011) 33–39.
[13] U. R. Acharya, H. Fujita, O. S. Lih, Y. Hagiwara, J. H. Tan, M. Adam,
Automated detection of arrhythmias using different intervals of tachycardia
ecg segments with convolutional neural network, Information sciences 405
(2017) 81–90.
24
[14] M. Sharma, V. M. Gadre, S. Porwal, An eigenfilter-based approach to the
design of time-frequency localization optimized two-channel linear phase
biorthogonal filter banks, Circuits, Systems, and Signal Processing 34 (3)
(2015) 931–959.
[15] M. Sharma, P. Achuth, R. B. Pachori, V. M. Gadre, A parametrization
technique to design joint time–frequency optimized discrete-time biorthog-
onal wavelet bases, Signal Processing 135 (2017) 107–120.
[16] G. Strang, T. Nguyen, Wavelets and filter banks, SIAM, 1996.
[17] R. Ishii, K. Furukawa, The uncertainty principle in discrete signals, IEEE
Transactions on Circuits and Systems 33 (10) (1986) 1032–1034.
[18] M. Sharma, R. Kolte, P. Patwardhan, V. Gadre, Time-frequency localiza-
tion optimized biorthogonal wavelets, in: Signal Processing and Commu-
nications (SPCOM), 2010 International Conference on, IEEE, 2010, pp.
1–5.
25