Professional Documents
Culture Documents
Abstract— In this paper a methodology to identify sleep Currently, sleep apnea is diagnosed using poly(somno)-
apnea events is presented. It uses four easily computable graphy, which is a sleep test evaluated by a clinician, and
features, three generally known ones and a newly proposed performed either in a hospital environment or at home. This
feature. Of the three well known parameters, two are computed
from the RR interval time series and the other one from the test monitors different physiological measurements such as
approximate respiratory signal derived from the ECG using heart rate and respiration, which are significantly affected
principal component analysis (PCA). The fourth feature is during episodes of apnea. Although poly(somno)graphy is
proposed in this paper and it is computed from the principal nowadays the most important tool to diagnose sleep dis-
components of the QRS complexes. Together with a least orders, its elevated costs and reduced comfort cause sleep
squares support vector machines (LS-SVM) classifier using an
RBF kernel, these four features achieve an accuracy on test data apnea to be under-diagnosed. To overcome this, several
larger than 85% for a subject independent classification, and studies like for instance [3], [4], [5] and [6], have considered
of more than 90% for a patient specific approach. These values the possibility of detecting episodes of sleep apnea by using
are comparable with other results in the literature, but have the only the ECG signal. In this way, home monitoring systems,
advantage that their computation is straightforward and much which are cheaper and more comfortable, can be developed
simpler. This can be important when implemented in a home
monitoring system, which typically has limited computational to detect and diagnose this disorder.
resources. In general home monitoring systems have limited calcula-
tion power, and this eliminates the possibility to implement
I. INTRODUCTION computationally expensive algorithms. That is why methods
are needed that call for less processing, but which nonethe-
Sleep apnea is considered an important factor for morbid- less achieve the same grade of accuracy as more demanding
ity and mortality due to its direct effect on the cardiovascular algorithms. In this paper such a method is proposed which
system [1]. These effects are associated with physiological uses a set of four very simple features. Amongst the features
functions such as systemic hypertension and increased sym- there are three generally known ones: namely the standard
pathetic activity that compromise the heart. deviation of the RR intervals, the serial correlation coefficient
Different types of respiratory events were identified in of the RR intervals and the standard deviation of the ECG de-
[2], and they are characterized by alterations in the airflow. rived respiration using Principal Component Analysis (PCA)
These alterations can consist of a reduced airflow as in [10]. In addition, a new feature is proposed here, based on
hypopneas or complete absence of airflow (during at least 10 the percentage of variance explained by the second principal
seconds) as in apneas. The classification of respiratory events component of a matrix formed by the QRS complexes.
as obstructive, mixed or central is based on the respiratory This feature will be explained in more detail in the next
effort. section. These features were used with a least squares support
vector machine (LS-SVM) classifier [7] using an RBF kernel,
*Research supported by Research Council KUL: GOA MaNet, to separate respiratory events from normal activity on two
PFV/10/002 (OPTEC), IDO 08/013 Autism, several PhD/postdoc & fel- different datasets.
low grants;Flemish Government: FWO: PhD/postdoc grants, projects:
G.0427.10N (Integrated EEG-fMRI), G.0108.11 (Compressed Sensing) The remainder of this paper is organized as follows.
G.0869.12N (Tumor imaging) G.0A5513N (Deep brain stimulation); Section II describes the two datasets used in this study and
IWT: TBM070713-Accelero, TBM080658-MRI (EEG-fMRI), TBM110697- the methodology implemented to analyze them. The results
NeoGuard, PhD Grants; iMinds 2012; Flanders Care: Demonstratieproject
Tele-Rehab III (2012-2014); Belgian Federal Science Policy Office: IUAP are presented and discussed in section III and the conclusions
P719/ (DYSCO, ’Dynamical systems, control and optimization’, 2012- in the final section.
2017); ESA AO-PGPF-01, PRODEX (CardioControl) C4000103224; EU:
RECAP 209G within INTERREG IVB NWE programme, EU HIP Trial II. METHODOLOGY
FP7-HEALTH/ 2007-2013 (n 260777), EU ITN Transact 2012; ERC ADG
A-DATADRIVE-B A. Data
C. Varon is with the KU Leuven, Department of Electrical Engineering
ESAT, SCD-SISTA, and iMinds Future Health Department, Leuven, Bel- Two datasets were used in this study. The first one was
gium e-mail: carolina.varon@esat.kuleuven.be the Apnea-ECG database [3] publicly available on Physionet1
D. Testelmans and B. Buyse are with the UZ Leuven, Department of [8]. This dataset consists of 70 single-lead ECG recordings
Pneumology , Leuven, Belgium
S. Van Huffel and J.A.K. Suykens are with the KU Leuven, Department sampled at 100Hz, and 8 respiratory signals available for 8 of
of Electrical Engineering ESAT, SCD-SISTA, and iMinds Future Health
Department, Belgium. 1 www.physionet.org
the 70 recordings. Each ECG signal was manually annotated features was extracted from the EDR and from the RR time
by an expert, who indicated on a minute-by-minute basis series. This set of features consists of variances of 5 levels
whether an apnea episode occurred. These annotations do of wavelet decomposition, 32 points of the power spectrum,
not differentiate between types of respiratory events. In total power in the low frequency band (0.04-0.15Hz) and power
34 313 minutes were extracted and annotated. in the high frequency band (0.15-0.4Hz). The computation
The second dataset consists of 10 single-lead ECG signals of this set of features is described in [4].
extracted from polysomnographic recordings of 10 different This feature set was then expanded with an extra parameter
patients of the sleep laboratory at the University Hospital to reduce the complexity and the number of features needed
Leuven (UZ Leuven), Belgium. Additionally, respiratory to classify respiratory events. This extra parameter is based
signals are available for 5 patients. All the signals were on the computation of the EDR signal using PCA. Usually,
sampled at 200Hz and their duration ranges from 320 to 469 only the end result of this computation is taken into account,
minutes (384±47). In total 3 847 minutes were extracted and however, in this paper it is shown that a variation of the
annotated by a medical doctor experienced on interpreting procedure provides information that can be related to apnea
polysomnographic signals. These annotations correspond to events. This becomes clear when this procedure is analyzed
the beginning and duration of the “respiratory events”, and in detail.
they make a distinction between 5 different types: obstructive In [10] a collection of beat features extracted from the
apnea (OSA), obstructive hypopnea (OSH), hypopnea (HPA), single-lead ECG is aligned in a single matrix to which PCA
mixed (Mix), and central apnea (Cen). is applied. Once the principal components (PCs) of the new
matrix are computed, the first PC is closely related to the
B. ECG processing and RR interval time series respiratory signal. In this work, the selected beat feature is
Initially, each signal was segmented into epochs of one the QRS complex, and the algorithm defines a window of
minute, which were then analyzed for artefacts using the 60ms around each R-peak, removes the mean and organizes
algorithm presented in [9]. If an ECG minute contained an these windows in a QRS matrix X ∈ RN ×M , where N is
artefact, it was removed from the dataset. Then, each re- the number of beats, M = 0.06 × f s, and f s the sampling
maining ECG minute was processed using the Pan-Tompkins frequency (see Fig. 1). The matrix X is transformed into a set
algorithm, which identifies the position of the R-peaks. of principal components by means of the eigendecomposition
Additionally, a search back procedure [5] was applied to of its covariance matrix, and the first component is then
detect and correct ectopic and missing beats. Once the R- identified as the ECG derived respiration. The diagram of
peaks were identified, the RR interval time series were Fig. 1 summarizes the procedure described above.
obtained by taking the time intervals between successive The variation proposed in this paper, applies the algo-
beats. rithm to X T rather than X. The reasoning behind this
adaptation can be explained very intuitively. Applying the
C. ECG derived respiration (EDR) procedure to X and keeping the first component gives an
After identifying the R-peaks, three different methodolo- estimation of the respiratory signal. When considering X T
gies, namely the amplitude of the R-peak, principal compo- rather than X, however, a “local” interpretation around each
nent analysis (PCA) [10] and kernel principal component R-peak is obtained. This local interpretation corresponds
analysis (kPCA) [11], were implemented to compute the to the morphology of the QRS complexes, which on its
ECG derived respiratory (EDR) signals. These three methods turn is modulated by respiration. Because of this relation
are based on the volume changes produced in the lungs with respiration, the local interpretation can be used to
during a respiratory cycle, which on its turn modifies the distinguish between normal and abnormal minutes. This
position of the electrodes with respect to the heart. In is clarified in Fig. 2, where a comparison between the
other words, they exploit the mechanical interaction between eigenvalues of two minutes, one normal and one containing
respiration and the morphology of the heart beats. Previous an apneic event, is displayed together with the percentage of
studies validated these methodologies [10], [11], [12], and variance explained by each component.
P These percentages
they showed that the computed EDRs closely resemble the of variance are defined as (λi / λi ) × 100, where λi is
real respiratory signals. the ith eigenvalue of the covariance matrix, i = 1, . . . , m,
and m the window size. In the figure it is observed that
D. Feature extraction and Feature selection when apnea occurs, the percentage of variance explained by
To characterize each ECG minute, a set of time and the first eigenvector is reduced, while the variance explained
frequency domain parameters were derived from the RR by the second increases. In other words, more components
interval time series and three EDR signals. The time domain are needed to describe the variations in the morphologies of
parameters derived from the RR time series were the mean, the QRS complexes. Taking into account the analysis above,
standard deviation, root mean square of inter-beat differen- this study proposes to use both the percentage of variance
tials, standard deviation of inter-beat differentials, 5 serial explained by the first and second principal components as
correlation coefficients and 4 fractal Alan factors. From the features.
EDR signals, only the mean and standard deviation were After deriving the set of features described above, their
calculated. Concerning the frequency domain, the same set of discrimination level was determined by means of the F1
Beats Window
× size EDR
0.12
Σ = XX T 1
α(1)
X=
λ
0.115
Σα(i) = λi α(i) 0.5
0.11
0
0 1 2 3 4 5 6 7 8 9 10 0 20 40 60
i Beat number
Fig. 1. Computation of the EDR signal using PCA [10]. From left to right: matrix containing the QRS complexes, general procedure to find the
principal components, eigenvalues of the covariance matrix, and EDR signal corresponding to the eigenvector with the highest eigenvalue. Note that the
first eigenvalue is large compared to the others, which indicates that most of the variance of the data is explained by its corresponding eigenvector.
Normal Apnea
55.72%
true negative, true positive, false positive and false negative
1 90.22% 1
events respectively.
0.8 0.8 43.31%
III. RESULTS AND DISCUSSION
0.6 0.6
λ