Professional Documents
Culture Documents
Abstract-This paper presents a new approach to the feature II. ECG DATA AND PREPROCESS
extraction for reliable heart rhythm recognition. This system
of classffication is comprised of three components including All ECG data were obtained from MIT-BIH arrhythmia
data preprocessing, feature extraction and classification of ECG
signals. Two different feature extraction methods are applied database that contains records of many patients with heart
together to obtain the feature vector of ECG data. The wavelet troubles or abnormalities. The frequency of the ECG data
transform is used to extract the coefficients of the transform as was 360HZ. Each record has its respective annotation file
the features of each ECG segment. Simultaneously, autoregressive that indicate the class of the heartbeat. A single channel
modelling(AR) is also applied to obtain the temporal structures ECG is collected and used to algorithm evaluation. Since
of ECG waveforms. Then the support vector machine(SVM) with
Gaussian kernel is used to classify different ECG heart rhythm. there are few categories of abnormal QRS complexes in
Computer simulations are provided to verify the performance of one record, we select different abnormal QRS complexes
the proposed method. From computer simulations, the overall from several records. Six types of QRS complexes appeared
accuracy of classffication for recognition of 6 heart rhythm types frequently in the database. Therefore, we mainly deal with
reaches 99.68%. six types heartbeats which include normal beat(NORMAL),
left bundle branch block beat(LBBB), right bundle branch
block beat(RBBB), paced beat(PACE), premature ventricular
I. INTRODUCTION
contraction(PVC) and atrial premature contraction(APC).
In the data preprocessing process, continuous ECG signals
The electrocardiogram (ECG) is routinely used in clinical must be separated into many segments which contain one
practice, which describes the electrical activity of the heart. In heartbeat. The extracted data of ECG complexes is centered
physical checkups at hospitals, physicians record the ECG af- around R peak. Considered that some PVC duration is great
ter the patient has exercised to check his/her cardiac condition. and sometimes R peak detection may be not the center of
The Holter ECG device is used most frequently for recording the complex, we have selected segment of 250ms before the
the ECG. Physicians apply the device to a patient when they fiducial point and 400ms after that with the R peak point is
need to monitor his/her ECG to find the few abnormal cycles the 90th point. The R peak is detected using the Pan and
in the ECG throughout the day. Physicians then interpret Tompkins algorithm[6]. Thus, each segment must contain one
the shapes of those waves and complexes. They calculate ECG heartbeat. Fig. 1 shows typical waveforms of six types of
parameters to determine whether the ECG shows signs of ECG segments.
cardiac disease or not. The parameters are the height and the
interval of each wave, such as RR interval, PP interval, QT
interval, and ST segment. Recognition of the fiducial points III. FEATURE EXTRACTION
and calculations of the parameters is a tedious routine for the
physician. Therefore, there is an urgent need for an automatic The recognition of heart rhythms requires generation of the
ECG recognition system to reduce the burden of interpreting feature vector which represents the original ECG segment. A
the ECG. good recognition system should depend on the features rep-
Various studies have been done for classification of various resenting the ECG signals in such a way, that the differences
cardiac arrhythmias [1][2][3][4]. In this paper, we propose among the ECG waveforms are suppressed for the waveforms
the combination of wavelet transform and AR model as the of the same type but are emphasized for the waveforms of
feature extraction method, then use the SVM to classify the belonging to different types of heartbeats. We perform the
ECG heartbeat. The proposed approach is validated in the recognition process of heart rhythms on the single heartbeat
MIT-BIH Arrhythmia Database[5] and get high accuracy of of the ECG, proposing the description or representation by
classification. wavelet transform and AR model.
0-7803-9422-4/05/$20.00 ©2005 IEEE
1089
Authorized licensed use limited to: VIT University. Downloaded on September 17,2020 at 03:28:19 UTC from IEEE Xplore. Restrictions apply.
Normal ECG beats of record 100
1.5
.D
0
0.5
c
.0
ito (D
O
0w 0
CD
LU
--_ _
"
500 G 100 150 200 250
-0.5
.1 L-
0 50 1 00 1 50 200 250 300
Time(sample point)
100 150 200 250
Time(sample point) (a)
Wavelet feature extraction of normal beats
3
Fig. 1. Typical ECG heartbeat waveforms of six types
2.!
1.!.5
A. Wavelet Transform _= _
.5 _-_
1090
Authorized licensed use limited to: VIT University. Downloaded on September 17,2020 at 03:28:19 UTC from IEEE Xplore. Restrictions apply.
- Normal
LBBB
RBBB
10, PACE
!n Q ~~~~~~~PVC
+ APC
8 99.8"l
-20 + +
-30,
<0 99.75 -
99.7-
-40
30 99.65-
20 40
10 20 99.6 -
0
Second Component -10 -20 First Component 6.0e-005 24e-005 97e-005 390e-005 0.015 0.0625
Gamma
Fig. 3. Data Distribution in Feature Space Fig. 4. Crossing-validation result by different parameters
n
model order selection in this work was ARfit method[8].
Eventually, we select 4-order AR model to represent each Subjectto: 0 < ci < C(i =1,n), ai = 0 (6)
heartbeat of ECG signals.
Where a > 0 are Lagrange multipliers. When the optimization
C. Generation of Feature Vectors problem has solved, many ai will be equal to 0, and the others
The feature vector was composed of wavelet coefficients will be Support Vectors. C is positive constant which chosen
and AR coefficients. For each ECG heartbeat, the origi- empirically by the user. This parameter expresses degree of
nal signal is a vector of 300 dimensions {Y1,Y2, ..., Y300}. loosing constraint. A large C can classify training examples
The wavelet coefficient for it is a vector of 32 dimensions more correctly. K(X, X') is the kernel function which is
{Wl, w2, ..., w32 }, and the AR coefficient for it is a vector of 4 inner-product defined by K(X, X') = q(X) q-(X'). Then
dimensions {a,, a2, a3,a4}. Then wavelet coefficient and AR the SVM decision function is
coefficient can be concatenated together to form the feature
vector {wI, w2, ..., w32,al,a2, a3, a4} for classification. f(X) = E aiyiK(Xi, X) + b. (7)
Every feature vector represents one point in the feature XiESV
space. Those points of same class should be closer and points A common kernel is the Gaussian radial basis function (RBF),
of different classes should be far from each other. In order
to view the data distribution of all types points of the ECG K(x, x ) = e- IX-X 112/2o2 (8)
heartbeat in figure, PCA was used to reduce the dimension
of each heartbeat's feature vector to three dimensions. Fig.3 In this work, the Gaussian kemel is used as the kernel function.
shows the data distribution of six classes ECG heartbeats in Classification consists of two steps: learning and testing.
three dimensions feature space. From the figure, we can see Our classifier is a learning machine of the supervised type
most points of different types heartbeats have be separated, and Multicategory SVM(MC-SVM). Firstly, all ECG segments
which proved that the process of feature extraction got the that contain a special type heartbeat are mapped into feature
good effect. space using wavelet and AR model which have been explained
above. In the learning phase, SVM receives some patterns as
IV. ECG CLASSIFICATION input. These patterns are heartbeats represented by m feature
Support Vector Machine(SVM) is one of the pattern recog- parameters that can be seen as points in rn-dimensional space.
nition methods, and is proposed by V.Vapnik and his co- Then the machine becomes able to find the labels of new
workers[9][10]. SVM separates an input example X = vectors by comparing them with those used in the learning
(xl, ... Xd) of dimension d into two classes. A decision function phase.
of SVM separates two classes by f(X) > 0 or f(X) < 0. The
size of training set N is (yi, Xi), i = 1, ..., N. Where Xi C RI V. RESULTS & DIsCUSSION
is the input pattern for the ith example, and yi C -1, 1 is In the numerical experiments, we have used the ECG data
the class label. Support Vector classifiers implicitly map Xi from the MIT-BIH Arrhythmia Database corresponding to
form input space to a higher dimensional feature space which the normal heartbeat and 5 types of arrhythmias. Each type
depend on a nonlinear function q(X). A separating hyperplane heartbeat was extracted from the record which contained most
is optimized by maximization of the margin. Then SVM is beats of this type. In this work, the NORMAL beats, LBBB,
solved as the following quadratic programming problem, RBBB, PACE, PVC and APC were extracted respectively from
n 1n the record 100, 109, 118, 107, 208 and 232. Then each type
Maximize: ai -- E ai ajyiyjK(XiI Xj), (5) heartbeat has a data set that contains many heartbeats. Due
to the scarcity of data corresponding to some beat types the
i=l i,j=l
1091
Authorized licensed use limited to: VIT University. Downloaded on September 17,2020 at 03:28:19 UTC from IEEE Xplore. Restrictions apply.
TABLE I
THE RESULT OF CLASSIFICATION
can be done quickly. The experiment results show that using
wavelet and AR model together can result in high accuracy of
classification.
1092
Authorized licensed use limited to: VIT University. Downloaded on September 17,2020 at 03:28:19 UTC from IEEE Xplore. Restrictions apply.