Professional Documents
Culture Documents
Abstract
1. Introduction
Sleep is a behavioral state that is a n atural part of every
individuals life. Human spend about one-third of their
lives asleep. Sleep is not just something to fill time when a
person is inactive. Sleep is important for normal motor and
cognitive function. Sleep actually appears to be required
for survival [1]. Sleepiness problem may be associated
with concentration difficulty, memory lapses, loss of
energy, fatigue, lethargy, and emotional instability. The
prevalence of problem sleepiness is high and has serious
consequences, such as drowsy driving or workplace
115
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
2. Methodology
The schematic diagram of the system, which is used in this
study is shown in Fig. 1. There are five main stages on this
system, preprocessing, feature extraction, dimensionality
reduction, classification, and performance evaluation.
Detail explanation of each stage will be covered in next
part of this paper.
2.1 Subjects
Data
used
in the
experiment is MIT-BIH
Polysomnographic Database. The database contains over
80 hours' worth of four-, six-, and seven-channel
polysomnographic recordings from 16 subjects, each with
an ECG signal annotated beat-by-beat, and EEG and
respiration signals annotated with respect to sleep stages
and apnea for every 30s epoch [7]. In this database, all 16
subjects were male, aged 32 to 56 (mean age 43), with
weights ranging from 89 to 152 kg (mean weight 119 kg).
Records slp01a and slp01b are segments of one subject's
polysomnogram, separated by a gap of about one hour;
records slp02a and slp02b are segments of another
subject's polysomnogram, separated by a t en-minute gap.
The remaining 14 records are all from different subjects.
The reference annotations were made by human experts on
the basis of simultaneously recorded signals. All
recordings include an ECG signal, an invasive blood
pressure signal, an EEG signal, and a respiration signal.
The six- and seven-channel recordings also include a
respiratory effort signal derived by inductance
plethysmography; some include an EOG signal and an
EMG signal and the remainder includes a cardiac stroke
volume signal and an earlobe oximeter signal. The sleep
stage distributions over the total 10087 epochs were:
W=2951, NREM 1=1750, NREM 2=3939, NREM 3=499,
NREM4= 222, REM= 726. Epoch with MT (movement
time) annotation was not included in this study.
116
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
2.2 Preprocessing
All the features used in this study are based on QRS
detection times. A QRS detection time is the time of
occurrence of the QRS complex in an ECG signal. In this
study, QRS detection times were generated automatically
for all recordings using Engelse and Zeelenberg algorithm
[8]. This algorithm provides detection times that occur
close to the onset of the QRS complex. RR-intervals
defined as the interval from the peak of one QRS complex
to the peak of the next. RR-interval sequences generated
from QRS detection times could contained physiologically
unreasonable times because signal quality limitation which
leads to errors in the automatically generated QRS
detections. In this study only RR intervals between 0.5 and
1.5 were processed on the stage of feature extraction [5].
SDANN
SDNNIDX
rMSSD*
pNN50
ULF
VLF*
LF*
HF*
LF/HF*
SDNN
117
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
118
(5)
Centralized Gram matrix is then used for estimation of the
operator:
(6)
Fig. 2 Illustration of dimensionality reduction for
regression/classification.
(7)
The size of Y Y | U in the ordered set of positive definite
matrices can be evaluated by its determinant. Using the
Schur decomposition,
(8)
we have
(3)
(9)
(4)
Fukumizu et al. uses the Gaussian kernel k(x1, x2) = exp(
x1 x2 2/22) [13]. They also show that for
probability-determining kernel spaces, the effective
subspace S can be characterized in terms of the solution to
the following minimization problem:
(10)
(11)
This minimization problem referred with respect to the
choice of subspace S or matrix B as Kernel Dimensionality
Reduction (KDR).
To determine the effective dimensions, in this study the
original feature vector dimension was reduced to 2, 3, and
4 dimension using KDR. The results of classification using
new reduced feature vector then compared with the result
of classification using original feature vector.
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
2.5 Classification
We applied three classification methods, which were kNN
as the baseline method, random forest and SVM. The last
two methods were selected because they have different
performance on imbalanced dataset. As we know, sleep
stage data has characteristics of imbalanced dataset due to
imbalanced distribution of each sleep stage. Table 3 shows
sleep stage distribution for each record. From table 3, we
can see that most of records dominated by sleep stage
NREM 2.
To train the classifier, 66 % of the epochs were randomly
selected and the remaining epochs presented to the testing
phase. The training data was chosen in such a way that the
distribution of sleep stages on each record retained. Cross
validation with 10 fold was then applied to assess how the
predictive model will generalize to an independent data set.
We performed 10 repetitions in the testing phase to ensure
that the results of classification describe the actual
condition.
Table 3: Sleep distribution for each record
Sleep Stage Distribution (%)
Records
slp01a
2.9
0.4
43.9
19.7
27.6
slp01b
50.6
7.6
34.7
7.1
slp02a
12.6
5.2
58.0
1.4
0.6
22.1
119
5.4
slp02b
39.6
5.3
44.2
10.9
slp03
18.1
15.2
44.8
11.3
10.7
slp04
22.2
8.2
61.7
4.6
3.2
slp14
45.1
26.2
17.7
4.2
1.7
5.1
slp16
45.4
15.6
26.2
3.2
0.3
9.4
slp32
61.3
4.2
25.0
6.8
2.7
slp37
10.8
3.0
84.6
1.6
slp41
29.0
29.6
28.1
1.7
11.6
slp45
0.9
0.9
62.3
8.9
12.6
14.4
slp48
27.9
31.8
35.9
0.3
4.1
slp59
30.7
23.0
21.1
11.0
6.6
7.7
slp60
40.7
47.8
7.1
4.5
slp61
17.3
12.3
45.4
14.3
10.7
slp66
39.4
32.8
26.6
1.1
slp67x
46.7
26.0
26.7
0.7
Overall
30.07
16.39
38.56
5.37
1.32
6.07
(12)
where Pr(a) is the relative observed agreement among
raters, and Pr(e) is the hypothetical probability of chance
agreement, using the observed data to calculate the
probabilities of each observer randomly saying each
category. If the raters are in complete agreement then = 1.
If there is no agreement among the raters then 1> 0.
Table 4 shows kappa coefficient interpretation [13].
Table 4: Kappa coefficient interpretation
Kappa
Agreement
<0
0.0 0.20
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
Independent
None
9
Specific
Independent
2
Specific
Independent
KDR
3
Specific
Independent
4
Specific
6
4
6
4
6
4
6
4
6
4
6
4
6
4
6
4
40.1
(4.26)
53.42
(4.14)
57.9
(6.06)
68.17
(5.86)
37.34
(4.18)
51.66
(3.91)
59.24
(6.64)
68.4
(6.27)
39.59
(4.78)
53.88
(4.63)
60.34
(5.79)
70.93
(5.64)
40.72
(4.77)
53.79
(4.28)
59.57
(6.4)
69.75
(5.6)
46.55
(4.39)
60.5
(4)
64.35
(5.22)
74.33
(4.86)
39.52
(4.11)
55.16
(4.33)
60.96
(6.38)
70.1
(6.19)
42.41
(4.47)
56.92
(4.23)
60.86
(5.88)
71.75
(5.58)
44.06
(4.32)
58.76
(3.98)
61.06
(5.97)
71.1
(5.95)
SVM
44.13
(3.35)
57.07
(2.22)
57.32
(3.91)
66.37
(4.43)
46.11
(3.61)
61.21
(3.35)
64.26
(5.27)
72.83
(5.1)
44.22
(3.78)
59.14
(3.13)
59.55
(4.42)
69.79
(5.34)
44.4
(3.8)
59.06
(2.94)
56.25
(3.49)
66.48
(4.14)
120
Independent
None
9
Specific
Independent
2
Specific
Independent
KDR
3
Specific
Independent
4
Specific
6
4
6
4
6
4
6
4
6
4
6
4
6
4
6
4
39.53
(2.22)
52.59
(2.06)
57.84
(2.74)
68.31
(2.57)
37.37
(1.92)
51.96
(2.11)
58.95
(2.93)
68.05
(2.83)
39.18
(2.15)
53.01
(2.03)
59.59
(3.03)
69.87
(2.67)
39.88
(2.24)
53.12
(2.02)
58.58
(2.97)
69.22
(2.63)
45.54
(2.34)
60.31
(1.99)
64.08
(3.13)
73.43
(2.56)
39.05
(2)
54.6
(2.31)
60.28
(2.98)
69.65
(2.95)
41.97
(2.2)
56.72
(2.5)
59.85
(3.23)
71.18
(2.64)
43.36
(2.04)
57.74
(2.19)
60.47
(3.26)
70.37
(2.75)
SVM
42.96
(1.39)
56.85
(0.76)
57.35
(1.92)
66.3
(1.65)
45.19
(1.94)
60.53
(1.52)
63.05
(2.56)
71.81
(2.33)
43.8
(1.52)
58.56
(1.46)
58.1
(2.19)
68.14
(2.12)
44.26
(1.66)
58.35
(1.25)
55.07
(1.46)
65.4
(1.87)
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
Reduction n
Class
Random
Forest
SVM
0.16
0.21
(0.03)
(0.04)
0.19
0.26
(0.03)
(0.04)
0.33
0.41
(0.04)
(0.05)
0.4
0.48
(0.05)
(0.05)
0.13
0.14
(0.03)
(0.03)
0.18
(0.03) 0.2 (0.04)
0.36
0.37
(0.04)
(0.05)
0.4
0.43
(0.05)
(0.05)
0.16
0.17
(0.03)
(0.03)
0.2
0.23
(0.03)
(0.04)
0.37
0.36
(0.05)
(0.05)
0.43
0.45
(0.05)
(0.05)
0.17
0.19
(0.03)
(0.03)
0.2
0.24
(0.03)
(0.04)
0.35
0.36
(0.05)
(0.05)
0.42
0.43
(0.05)
(0.05)
0.09
(0.02)
0.05
(0.02)
0.21
(0.03)
0.22
(0.03)
0.17
(0.03)
0.21
(0.03)
0.36
(0.05)
0.42
(0.05)
0.12
(0.02)
0.14
(0.03)
0.22
(0.04)
0.29
(0.05)
0.12
(0.03)
0.13
(0.03)
0.14
(0.03)
0.21
(0.04)
kNN
Independent
None
9
Specific
Independent
2
Specific
Independent
KDR
3
Specific
Independent
4
Specific
6
4
6
4
6
4
6
4
6
4
6
4
6
4
6
4
121
Dim
Effectiv Subject Outpu
Reductio e
t
n
Dim
Class
kNN vs
RF
kNN vs
SVM
None
KDR
RF vs
SVM
Independent
99
16
84
73
27
100
95
87
13
Specifi
c
93
52
48
88
12
94
62
38
88
12
Average 3
97
34
66
84
16
Independent
25
75
100
99
19
81
100
100
Specifi
c
28
72
14
86
22
78
28
72
13
87
22
78
Average 25
75
93
11
89
Independent
12
88
99
21
79
91
98
24
76
Specifi
c
38
62
59
41
58
42
32
68
56
44
71
29
Average 23
77
30
70
44
56
Independent
94
98
29
71
92
100
42
58
Specifi
c
25
75
69
31
80
20
31
69
68
32
81
19
Average 18
82
35
65
58
42
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
4. Conclusions
References
[1]
[2]
Fig. 4 KDR performance on subject specific data with output class = 6.
[3]
[4]
122
IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 4, No 2, July 2011
ISSN (Online): 1694-0814
www.IJCSI.org
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
123