Computerized Heart Sounds Analysis: S.M. Debbal, F. Bereksi-Reguig

Computers in Biology and Medicine 38 (2008) 263 – 280
www.intl.elsevierhealth.com/journals/cobm
Computerized heart sounds analysis

S.M. Debbal ∗ , F. Bereksi-Reguig
Genie-Biomedical Laboratory, Department of Electronics, Faculty of Science Engineering, University A.B. Belkaid-Tlemcen, BP 119, Tlemcen 13000, Algeria
Received 23 December 2006; accepted 26 September 2007
Abstract
This paper is concerned with a synthesis study of the fast Fourier transform (FFT), the short-time Fourier transform (STFT), the Wigner
distribution (WD) and the wavelet transform (WT) in analysing the phonocardiogram signal (PCG). It is shown that these transforms provide
enough features of the PCG signals that will help clinics to obtain qualitative and quantitative measurements of the time–frequency (TF) PCG
signal characteristics and consequently aid diagnosis. Similarly, it is shown that the frequency content of such a signal can be determined by the
FFT without difficulties. The studied techniques (FT, STFT, WD, CWT, DWT and PWT) of analysis can thus be regarded as complementary
in the TF analysis of the PCG signal; each will relate to a part distinct from the analysis in question.
䉷 2007 Elsevier Ltd. All rights reserved.
Keywords: Phonocardiogram; Signal processing; Sounds; Time–frequency analysis; Signal analysis; FFT; STFT; WD; WT
1. Introduction normal cardiac cycle. Heartbeat sound analysis by auscultation

only is still insufficient to diagnose some heart diseases. It does
Heartbeat sound analysis by auscultation is still insufficient not enable the analyst to obtain both qualitative and quantitative
to diagnose some heart diseases. It does not enable the ana- characteristics of S1 and S2 of the PCG [1,2]. Moreover, in
lyst to obtain both qualitative and quantitative characteristics of studying the physical characteristics of heart sounds and human
the phonocardiogram (PCG) signals [1,2]. Abnormal heartbeat hearing, it is seen that the human ear is poorly suited to cardiac
sounds may contain, in addition to the first and second sounds, auscultation [3]. Therefore, clinic capabilities to diagnose heart
S1 and S2, murmurs and aberrations caused by different patho- sounds are limited. In this paper we are interested in the study
logical conditions of the cardiovascular system [2]. Moreover, of the physical characteristics of the second heart sound S2
in studying the physical characteristics of heart sounds and hu- which consists of two major components in the spectrum of
man hearing, it is seen that the human ear is poorly suited for the signal. One of these components A2 is due to the closure
cardiac auscultation [3]. Therefore, clinic capabilities to diag- of the aortic valve and the other P2 is due to the closure of the
nose heart sounds are limited. pulmonary valve.
The sound emitted by a human heart during a single cardiac The aortic component is louder than the pulmonic compo-
cycle consists of two dominant events, known as the first heart nent. It is discernible at all the auscultation sites. It is best heard
sound S1 and the second heart sound S2 (Fig. 1), S1 relates to at the right base, with the diaphragm of the chest piece firmly
the closing of the mitral and tricuspid valves whilst S2 is gen- pressed, whereas the pulmonic one may only be audible at the
erated by the halting of the aortic and pulmonary valves leaflets left base, with the diaphragm of the chest piece firmly pressed.
[1]. S1 corresponds in timing to the QRS complex in ECG The aortic component has higher frequency contents and
(Electrocardiogram) and S2 follows the systolic pause in the generally precedes the pulmonary component because in nor-
mal heart activity the aortic valve closes before the pulmonary
valve. The difference in time occurrence between these valves’
∗ Corresponding author. Tel.: +213 4327391. activities is known in the medical community [4–6] as split.
E-mail addresses: adebbal@yahoo.fr (S.M. Debbal), bereksif@yahoo.fr However, in many diseases [1] this order of time occurrence
(F. Bereksi-Reguig). may be reverse and its delay varies.
0010-4825/$ - see front matter 䉷 2007 Elsevier Ltd. All rights reserved.
doi:10.1016/j.compbiomed.2007.09.006
264 S.M. Debbal, F. Bereksi-Reguig / Computers in Biology and Medicine 38 (2008) 263 – 280
cross-terms is by smoothing the TF plane, but this will be at the

expense of decreased resolution in both time and frequency [7].
The WD was applied to heart sound signal and it shows no
success in displaying or separating the signal components in
both time and frequency directions [6], although it provides
high time and frequency resolution in simple monocomponent
signal analysis [8].
To overcome these difficulties with the STFT and the WD
an alternative way to analyse the non-stationary signals is the
WT. It expands the signal to some basis functions. The basis
functions can be constructed by dilation, contractions and shifts
of a unique function called the “wavelet prototype” or “wavelet
mother”. The WT acts as “mathematical microscope” in which
we can observe different parts of the signal by just adjusting
the focus.
The WT is a technique in the domain of TF distributions. The
Fig. 1. Correlation between the phonocardiogram signal (PCG) and the elec- main idea of this method is the representation of an arbitrary
trocardiogram signal (ECG). signal as a superposition of basic signals, “atoms”, located in
time and frequency. These atoms may be derived by means of
a special operation on a single parent atom. Parent atoms and
The characteristics of the PCG signal and other features such derivation operation are usually chosen such as to enable the
as heart sounds S1 and S2 location, the number of components construction of an orthonormal system [9].
for each sound, their frequency content, their time interval, all The study of the decomposition of the signal in “atoms” was
can be measured more accurately by digital signal processing first carried out by Gabor; however, it was quickly abandoned
techniques. because of:
The fast fourier transform (FFT) can provide a basic un-
1. the no simultaneous representation in time and frequency;
derstanding of the frequency contents of the heart sounds.
2. grid made up of rectangular cells is not a flexible device;
However, FFT analysis remains of limited values if the station-
3. the mathematical theory of the phenomenon is badly struc-
ary assumption of the signal is violated. Since heart sounds
tured.
exhibit marked changes with time and frequency, they are
therefore classified as non-stationary signals. To understand The representation time scale of WT based on a dyadic
the exact feature of such signals, it is thus important to study paving appears more flexible. It is a mathematical structure
their time–frequency (TF) characteristics. governed by a formula of exact inversion [10] making possible
In this paper the wavelet transform (WT) is used to analyse the existence of orthonormal basis. This makes the wavelet to
both normal and abnormal heart sounds in both time and fre- be a simultaneous function of time and frequency.
quency domains. This technique has been shown to have a very In this paper the continuous wavelet transform (CWT) is
good time resolution for high-frequency components. In fact, applied to analyse pathological PCG signals. The CWT is more
the time resolution increases as the frequency increases and the appropriate than the discrete wavelet transform (DWT), since
frequency resolution increases as the frequency decreases [4,5]. we are interested in the analysis of non-stationary signals and
Furthermore, the WT has demonstrated the ability to analyse not signal coding where DWT is found to be more useful.
the heart sound more accurately than other techniques such
as short-time Fourier transform (STFT) or Wigner distribution 2. Theoretical background
(WD) [6] in some pathological cases.
In fact, the spectrogram (STFT) cannot track very sensitive 2.1. Fourier transform (FT)
sudden changes in the time direction. To deal with these time
changes properly, it is necessary to keep the length of the time In 1882, Joseph Fourier discovered that any periodic func-
window as short as possible. This, however, will reduce the tion could be represented as an infinite sum of periodic com-
frequency resolution in the TF plane. Hence, there is a trade-off plex exponential functions [10]. The inclusive property of only
between time and frequency resolutions [6]. periodic functions was later extended to any discrete-time func-
The WD has shown good performances in the analysis of tion. The Fourier transform (FT) (as regular Fourier transform)
non-stationary signals. This comes from the ability of the WD converts a signal expressed in the time domain to a signal ex-
to separate signals in both time and frequency directions. One pressed in the frequency domain. The FT is widely used and
advantage of the WD over the STFT is that it does not suffer usually implemented in the form of FFT algorithm. The math-
from the TF trade-off problem. On the other hand, the WD ematical definition of FT is given below:
has a disadvantage since it shows cross-terms in its response.
These cross-terms are due to the non-linear behaviour of the X(f ) = x(t)e−j2f t dt, (1)
WD, and bear no physical meaning. One way to remove these
S.M. Debbal, F. Bereksi-Reguig / Computers in Biology and Medicine 38 (2008) 263 – 280 265
where t and f are, respectively, the time and frequency param- The auto WD is obtained when x(t) = y(t) = s(t); it is a
eters. The time domain signal x(t) is multiplied by a complex bilinear function of the signal s(t), the WD, and can also be
exponential at a frequency f and integrates over all time. In expressed as
other words, any discrete time signal may be represented by +∞
a sum of sines and cosines, which are shifted and are multi- ∗ −j
S(t, ) = s t+ s t− e d, (4)
plied by a coefficient that changes their amplitude. X(f ) are −∞ 2 2
the Fourier coefficients which are large when a signal contains
where t and are, respectively, the time and frequency vari-
a frequency component around the frequency f.
ables, and ∗ denotes the complex conjugate.
The peaks in a plot of FT of a signal correspond to dominant
The WD had shown good applications in the analysis of
frequency components of the signal. Fourier analysis is sim-
non-stationary signals [7]. This comes from the ability of this
ply not effective when used on non-stationary signals because
method to separate signals in both time and frequency direc-
it does not provide frequency content information localized in
tions. The WD has a disadvantage that it is limited by the
time. Most real world signals exhibit non-stationary character-
appearance of cross-terms. These cross-terms are due to the
istics (such as heart sound signals). Thus, Fourier analysis is
non-linear behaviour of the WD, and bear no physical mean-
not adequate.
ing. One way to remove these cross-terms is by smoothing the
TF plane [7], but this will be at the expense of decreased res-
2.2. Short-time Fourier transform (STFT) olution in both time and frequency.
STFT is obtained by calculating the FT of a sliding windowed 2.4. Continuous wavelet transform (CWT)
version of the time signal s(t). The location of the sliding
window adds a time dimension and one gets a time-varying CWT was developed as a method to obtain simultaneous,
frequency analysis. high-resolution time and frequency information about a signal.
The mathematical representation of STFT is The term ‘wavelet’ was first mentioned in 1909 in a thesis by
+∞
Alfred Haar (M. Misiti, Y. Misiti, G. Oppenheim, J.-M. Poggi,
S(t, f ) = s()w( − t)e−j2f t d, (2) Wavelet Toolbox: for use with MATLAB. The Math Works Inc.
−∞ 1996), although the progress in the field of wavelet has been
relatively slow until the 1980s when scientists and engineers
where w( − t) is the sliding window applied to the signal s(t), from different fields realized they were working on the same
f is the frequency and t is the time. concept and began collaborating.
The length of the window is chosen to maintain signal sta- CWT rather than STFT uses a variable sized window region.
tionary in order to calculate the FT. To reduce the effect of Because the wavelet may be dilated or compressed, different
leakage (the effect of having finite duration), each sub-record features of the signal are extracted. While a narrow wavelet
is then multiplied by an appropriate window and then the FT is extracts high-frequency components, a stretched wavelet picks
applied to each sub-record. As long as each sub-record does not up on the lower frequency components of the signal.
contain rapid changes the spectrogram will give an excellent CWT is computed by correlating the signal s(t) with families
idea of how the spectral composition of the signal has changed of TF atoms g(t); it produces a set of coefficients C(a, b)
during the whole time record. However, there exist many phys- given by
ical signals whose spectral content is so rapidly changing that +∞
finding an appropriate short-time window is problematic, since 1 ∗ t −b
C(a, b) = √ s(t)g dt (5)
there may not be any time interval for which the signal is sta- a −∞ a
tionary. To deal with these time changes properly, it is neces- +∞
sary to keep the length of the time window as short as possible. √
= a G(aw)S(w)ejbw dw, (6)
This, however, will reduce the frequency resolution in the TF −∞
plane. Hence, there is a trade-off between time and frequency
where
resolutions.
• b is the time location;
2.3. WD function • a is called the scale factor and it is inversely proportional to
the frequency (a > 0);
In contrast to STFT, which is resolution limited either in time • * dénotes a complex conjugate;
or in frequency (dictated by the window function), and suffers • g(t) is the analysing wavelet;
from smearing and side lobe leakage, the WD offers excellent • S(w) and G(w) are, respectively, the FTs of s(t) and g(t).
resolution in both the frequency and time domains. The WD of
two signals, x(t), y(t), is defined via The analysing wavelet function g(t) should satisfy some
properties. The most important ones are continuity, integrabil-
+∞ ∗ −j
W (t, ) = x t+ y t− e d. (3) ity, square integrability, progressivity and it has no d.c. compo-
−∞ 2 2 nent. Moreover, the wavelet g(t) has to be concentrated in both
time and frequency as much as possible. It is well known that (decimation) according to the rule of Nyquist. This leads to a
the smallest time–bandwidth product is achieved by the Gaus- considerable reduction in computing time. Every approxima-
sian function [3]. Hence, the most suitable analysing wavelet tion is decomposed all over again in approximation and detail.
for TF analysis is the complex exponential modulated Gaussian In this case the signal is decomposed into several frequency
function of the form bands instead of two bands. The number of bands depends on
2 the decomposition level.
t
g(t) = exp − + jw0 t . (7) Fig. 2 shows the discrete wavelet decomposition on three
2 levels, as well as the note associated filter bank for each level.
If we choose the analysing wavelet that has the following FT: We note that the width of the filter banks bandpass is verita-
ble according to the decomposition level. The wavelet analy-
G(w) = A exp[−(w − w0 )2 /2] + ε, (8) sis permits to decompose the PCG signal in filter banks. The
signal to be analysed is decomposed in approximation and in
where ε is a small correction term, theoretically necessary to detail while using two filters h and g. The h filter is a low-pass
satisfy the admissibility conditions of wavelets, w0 is chosen filter with a bandpass [0, /2]; it generates the approximation
large enough so that the correction term is negligible and can signal “A”.
be ignored.
This is known as the Gabor wavelet. It was shown [5] that
2.6. Wavelet packet transform (WPT)
w0 = 5.33, which is enough to make the correction term neg-
ligible and gives an optimal time–bandwidth product.
Wavelet packet analysis is an extension of DWT and it turns
In a CWT, the wavelet corresponding to the scale and the
out that DWT is only one of the many possible decompositions
time location b is given by
that could be performed on the signal. Instead of just decom-

t −b posing the low frequency component, it is therefore possible
ga,b (t) = 1/ |a|g , (9) to subdivide the whole TF plane into different TF pieces as
a
can be seen from. The advantage of wavelet packet analysis is
where g(t) is the wavelet “prototype” or mother which can be that it is possible to combine the different levels of decompo-
thought of as a bandpass function. The factor /a/−1/2 is used sition in order to achieve the optimum TF representation of the
to ensure energy preservation [5]. original [13].
To obtain a uniform filter bank we will use the wavelet packet
2.5. Discrete wavelet transform (DWT) analysis which is a generalization of the DWT analysis. The
decomposition is also made of the detail signal “D”. Fig. 3
Wavelet series coefficients are sampled CWT coefficients. shows the wavelet packet decomposition on three levels as well
Time remains continuous but time scale parameters (b, a) are as the filter banks associated for each uniform level.
sampled on a so-called “dyadic” grid in the time scale plane
(b, a) [11]. A common definition is 3. Results and discussion
Cj k = CWT < s(t); a = 2j , b = k2j > with j, k ∈ Z. (10) 3.1. FFT application
In this case the wavelets are An FFT algorithm is first applied to the PCG signal given in
−j/2 −j Fig. 1. The frequency spectrum illustrated in Fig. 4(a) shows
j k (t) = 2 (2 t − k). (11)
that the normal PCG signal has a frequency content varying
The DWT has been recognized as a natural WT for discrete- from around 40 Hz up to 200 Hz. The FFT can be applied to
time signals. Both the time and time-scale parameters are the first part of this signal to analyse the frequency content of
discrete. S1 as shown in Fig. 4(b) and then to the second half to analyse
The discretization process partially depends upon the algo- the frequency content of S2 as shown in Fig. 4(c). A 512 points
rithm chosen to perform the transformation. FFT is applied to S1 and S2. At this stage the sound S1 or S2
Cj,k could be well approximated by digital filter banks. By cannot be separated.
using Mallat’s [12] remarkable fast pyramid algorithms which In fact, the application of FFT on heart sounds S1 and S2
involve use of low-pass and high-pass filters. The Mallat algo- after their separation or identification shows that the basic fre-
rithm is in fact a classical scheme known in the signal process- quency components are obviously detected by FT.
ing community as a two-channel subband coder. The wavelet The spectrum of S1 has reasonable values in the range
analysis permits to decompose the PCG signal in filter banks. 10–180 Hz. The spectrum is distinctly resolved in time into
The signal to be analysed is decomposed in approximation and two majors components (M1 and T 1) while most of the energy
in detail while using two filters h and g. The h filter is a low- of these sounds appears to be concentrated.
pass filter with a bandpass [0, /2]; it generates the approxi- The two components A2 (due to the closure of the aortic
mation signal “A”. valve) and P2 (due to the closure of the pulmonary valve) of
The g filter is a high–pass filter of bandpass [0, ], it gener- the second sound S2 are obvious in Fig. 4(c). The spectrum
ates the detail signal “D”. The filtered signals are under-sampled of the sound S2 has reasonable values in the range 50–250 Hz.
Fig. 2. (a) Example of the discrete wavelet decompostion for three levels and (b) filter banks of the decomposition of each level.
Fig. 3. (a) Example of the packet wavelet decompostion for three levels and (b) filter banks of the decomposition of each level.
The spectrum for this sound is distinctly resolved in time into the “split” which separates them. For a normal heart activity
two majors components (A2 and P2) as shown in Fig. 4(c). usually A2 precedes P2 and the value of the split is lower than
However, the FFT analysis of S2 cannot tell which of A2 and 30 ms. This time delay between A2 and P2 is very important
P2 precedes the other, or the value of the time delay known as to detect some pathological cases. The sound S2 seems to have
x 10-3
1.5 2
frequency spectrum for
normalized amplitude
normal cardiacs sounds
1 the normal cardiacs sounds
1.5
fft magnitude
S2
0.5 S1
0 1
-0.5
0.5
-1
-1.5 0
1000 2000 3000 4000 5000 100 200 300 400 500
number of samples fréquency(Hz)
x 10-3
1.5 4
Sound S1
1 frequency spectrum
3
fft magnitude
for the sound S1
0.5
0 2
-0.5
1
-1
-1.5 0
500 1000 1500 2000 100 200 300 400 500
number of samples fréquency (Hz)
1.5 0.2
frequency spectrum
1 Sound S2
for the sound S2
0.15
fft magnitude
0.5
0 0.1
P2 A2
-0.5
0.05
-1
-1.5 0
3000 3500 4000 4500 100 200 300 400 500
number of samples Frequency (Hz)
Fig. 4. Frequency spectrum for the normal cardiacs sounds and the sounds S1 and S2.
higher frequency content than that of S1 as shown in Fig. 4(b) there are many more frequency components in this range. On
and (c). the other hand, the mitral stenosi is rather a severe attack of
The FFT is also applied to analyse fourth PCG signals, one the mitral valves involving a presystolic reinforcement as well
normal and three different marked pathological cases (the aortic as a bursting of the sound S1.
insufficiency (AI), the aortic stenosis and the mitral stenosis). As the frequency extent of the sound S1 is less important
These are illustrated in Fig. 5 along with the normal PCG signal. than that of the sound S2, the spectral response of the PCG
The basic frequency content is obviously different from that signal related to this pathological case is not much affected
of the normal PCG signal. It is clearly shown that there is compared to that of the normal case as was the case in aortic
great loss of frequency component in each of the pathological insufficiency and aortic stenosis.
cases with respect to the normal case. In addition, except the In conclusion, and by applying the spectral analysis to dif-
AI case where we note the apparition of frequency component ferent PCG signals, we can affirm which of the sounds S1 or
higher than 200 Hz, the other cases (mitral-stenosis and aortic- S2 is directly concerned by the pathology, and more precisely,
stenosis) present a frequency spectrum limited to 200 Hz. which component of these sounds is affected.
AI and aortic stenosis are two pathological cases resulting With regard to normal PCG the basic frequency components
from a severe organic attack, which generally involves a disap- are obviously detected by the FFT but not the time delay be-
pearance of the aortic component A2 of the sound S2. This is tween these components. In fact as it was shown for example
shown in their corresponding PCG frequency responses illus- in Fig. 4(c), the components A2 and P2 of the second sound S2
trated in Fig. 5, where we notice a lack of frequency contents are obvious. However, the FFT analysis of S2 cannot tell what
in the range under 100 Hz compared to the normal case, where is the value of the time delay between A2 and P2. It is thus
x 10-3
1.5 2
normal cardiacs sounds frequency spectrum
1 for the normal
fft magnitude
1.5 cardiacs sounds
0.5
0 1
-0.5
0.5
-1
-1.5 0
0 2000 4000 6000 100 200 300 400 500
number of samples fréquency (Hz)
1.5 15
aortic-stenosis frequency spectrum

1 for the aortic-stenosis
fft magnitude
0.5 10
0
-0.5 5
-1
-1.5 0
0 2000 4000 6000 100 200 300 400 500
number of samples frequency (Hz)
1.5 15
aortic-insuficiency frequency spectrum

1 for the aortic-insuficiency
fft magnitude
0.5 10
0
-0.5 5
-1
-1.5 0
0 2000 4000 6000 8000 100 200 300 400 500
nuber of samples frequency (Hz)
1.5 1.5
mitral-stenosis frequency spectrum

1 for the mitral-stenosis
fft magnitude
0.5 1
0
-0.5 0.5
-1
-1.5 0
0 2000 4000 6000 8000 100 200 300 400 500
number of samples frequency (Hz)
Fig. 5. Normal and abnormals cardiac sounds and their frequency spectrum, respectively.
essential to look for a transform which will describe a kind of Figs. 6(b) and 7(b) provide, respectively, the STFT applica-
“time-varying” spectrum. tion of the normal and the CA case. From these figures we can
see the difference of the TF features between them. For the
3.2. STFT application two cases (N and CA), the second sound (S2) is shown to have
The normal PCG signal (Fig. 6(a)) and the coarctation of the higher frequency content than the first sound (S1) [14,15]. This
aorta (Fig. 6(a)) are analysed in this section. The coarctation is expected since the amount of blood present in the cardiac
of the aorta has been deliberately chosen here to evaluate the chambers is lesser [16].
performance of the STFT analysis for it is very similar to the We consider here two examples of the PCG signals with
normal case. Thus, Fig. 6(a) illustrates such a signal where we murmur: the pulmonary stenosis (PS: Fig. 8(a)) with a systolic
can notice that the temporal representation is almost similar to murmur and the aortic regurgitation (AR: Fig. 9(a)) with a
that of normal case. diastolic murmur.
0.4
N
S1 S2 S1
0.2
amplitude
0
-0.2 Systole Diastole
-0.4
0 0.5 1 1.5 2 2.5
samples
x 104
600
500
frequency (Hz)
400
300 S1 S2 S1
200
100
0 0.5 1 1.5 2 2.5 3

samples
Fig. 6. The STFT application of the normal PCG (N): (a) the normal PCG signal (N) and (b) the spectrogram of the normal signal (N).
0.4
CA
S1 S2 S1
0.2
amplitude
-0.2 Systole Diastole
-0.4
0 0.5 1 1.5 2 2.5
samples
x 104
600
500
frequency (Hz)
400 S1 S2 S1
300
200
100
0 0.5 1 1.5 2 2.5 3

samples
Fig. 7. The STFT application of the coarctation of the aorta (CA): (a) the coarctation of the aorta signal (CA) and (b) the spectrogram of the signal CA.
Figs. 8(b) and 9(b) show, respectively, the STFT application Group 1: normal (N) or similar morphological signal (CA);
of the PS and AR signals. We can notice that the frequency Group 2: opening snap (OS) and ejection click (EC);
extent of the diastolic murmur of the AR case is higher (about Group 3: PCG signal with width murmur (PS and AR).
600 Hz) than the systolic murmur of the PS case (about 400 Hz).
In this section we present the experimental results of the Fig. 10 provides a better representation of the results ob-
short TF transform application of the three followings groups tained concerning the frequency contents of the sounds and
of the PCG signals used. murmurs analysed. Under normal conditions (N) or in the
a
1
systolic PS
0.5 murmur
amplitude
0
-0.5
S1 S2
-1
0 0.5 1 1.5 2 2.5
samples
b x 104
600
500 systolic
frequency (Hz)
murmur
400
300
200
100 S1 S2
0 0.5 1 1.5 2 2.5 3

samples
Fig. 8. The STFT application of the pulmonary stenosis: (a) the pulmonary stenosis signal (PS) and (b) the spectrogram of the signal PS.
0.4
diastolic AR
murmur
0.2
amplitude
-0.2
S2 S1 S2
-0.4
0 0.5 1 1.5 2 2.5
samples
x 104
800
diastolic
frequency (Hz)
600 murmur
400
200 S2 S1 S2
0 0.5 1 1.5 2 2.5 3

samples
Fig. 9. The STFT application of the aortic of the regurgitation: (a) the aortic of the regurgitation signal (AR) and (b) the spectrogram of the signal AR.
presence of similar signals (CA), the frequency content of the Finally, in fact the width murmurs (PS and AR cases) present
sound S2 is more significant than that of sound S1. a frequency extent that is very significant. Discrimination be-
We noted that the light murmurs (OS, EC) can influence tween the systolic and diastolic murmurs can be made starting
the TF content of the principal sounds S1 and S2 and have a from this frequency extent, diastolic murmurs, thus having a
frequency extent more significant than them. frequency extent more significant than the systolic murmurs.
800 In Fig. 12(d) we can see that S1 is clearly resolved into two
major components (M1 and T1). In Fig. 12(f) the sound S2 is
700 PCG signals
also resolved into two major components (A2 and P2). The time
with murmur
AR
600
delay between A2 and P2 can be easily measured with the use
of the wavelet coefficients (Fig. 12(f)). This delay is measured
Frequency extent
500 to be 13 ms. It is smaller than the 30 ms [17] as foreseen in the

PCG signals
with click PS normal conditions of the PCG signal. Pathological conditions
400 First and second could cause this time difference to narrow or widen. Moreover,
heart sounds OS
300 the order of occurrence of A2 and P2 may be reversed. The WT
EC
S2
allows measurement and determination of this time difference,
200 and thus allows a diagnosis process regarding this important
S1
parameter to be produced.
100
Table 1 resumes the differences observed between the com-
0 ponents A2 and P2.
0 1 2 3 4 5 6 7 It can be concluded for the normal PCG that
Fig. 10. Frequency extent of the three groups of PCG signals.
• The component A2 precedes in time the component P2.
• The component A2 has a higher frequency content than P2.
• The amplitude of A2 is more important than that of P2.
In many cases these murmurs seem not to affect the TF content
of the sounds S1 and S2 too much?.
These parameters, particularly the frequency, make it possible
to see a difference between A2 and P2.
3.3. WD application Moreover, the ability of WT in heart disease diagnosis can
be studied by applying the CWT algorithm on different marked
Fig. 11 shows the WD application of the second cardiac cases. The results of this application are illustrated in Fig. 13(b)
sound of the normal case ((a)), the coarctation of the aorta (AI), Fig. 13(c) (aortic stenosis) Fig. 13(d) (mitral stenosis).
case ((b)) and the innocent murmur case ((c)). One can notice The coefficients of CWT allow us to clearly discern the fre-
here that the two principal components (A2 and P2) start to quency range of each signal. It also shows the major compo-
appear in the presence of cross-terms. The WD results may be nents according to temporary variation; the maximal ampli-
improved by increasing the sampling rate of original signal, tude is characterized by a darker colour than those of the small
but it still suffers from the cross-terms problem because of the amplitudes.
non-linearity of the WD.
However, the WD has shown good performances in the anal-
ysis of non-stationary signals. This comes from the ability of 3.5. DWT application
the WD to separate signals in both time and frequency direc-
tions. One advantage of the WD over the STFT is that it does The multiresolution analysis based on DWT is a power-
not suffer from the TF trade-off problem. On the other hand, ful tool in filtering, separating and identification of the inter-
the WD has a disadvantage since it shows cross-terms in its re- nal components and murmurs of the various analysed signals
sponse. These cross-terms are due to the non-linear behaviour (Fig. 14).
of the WD, and bear no physical meaning. One way to re- Fig. 15(a) shows the application of the DWT of one cycle of
move these cross-terms is by smoothing the TF plane, but this the normal PCG. Levels d1 and d2 represent the high-frequency
will be at the expense of decreased resolution in both time and variations of the baseline of PCG signal. Levels d3 and d4 em-
frequency. phasize clearly the side of high-frequency content of the sound
S2 compared to S1. The component A2 (the most predomi-
nant in sound S2) appears better on levels d5 and d6. Level
3.4. CWT application
d7 represents the two principal components of the sound S1
(M1, T1).
An algorithm under MATLAB environment of the CWT is
The same analysis by using the DWT concerning the coarc-
developed and then applied to analyse the different PCG sig-
tation of the aorta and the mitral stenosis is also applied
nals. First, the CWT application of the normal PCG is studied
(Fig. 16(b) and (c)). Fig. 13(b), in level d7, provides a good
as illustrated by Fig. 12(a). The contour plot given by Fig. 12(b)
representation of the third component “C” adding a the two
gives the result more clearly. The two heart sounds are clearly
major components of the sound S2 (A2 and P2).
shown in dark colour. The CWTs of S1 and S2 are also dis-
played separately in Fig. 12(c) and (e), respectively, and their
contour plot, respectively, by Fig. 12(d) and (f). The sound S2 3.6. Packet WT application
is shown to have higher frequency content than that of S1. This
is expected since the amount of blood present in the cardiac The use of the wavelet packet transform (WPT) is exactly
chambers is lesser [1,16]. the same as those developed in the DWT. The only difference
CWS s=0.01
350
Sound S2: N
300 Cross-terms
time in samples
250
200
150
A2
100
P2
50
0
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04
Frequency (x10 KHz)
CWS s=0.01
350
Sound S2 : CA
300
250
time in samples
200
150
100
50
0
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04
Frequency (x10 KHz)
CWS s=0.01
450
Sound S2 : IM
400
350
time in samples
300
250
200
150
100
50
0
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04
Frequency (x10 KHz)
Fig. 11. The Wigner distribution of the second cardiac sound S2: (a) the normal case; (b) the coarctation of the aorta case (CA); and (c) the innocent murmur
case (IM).
is that wavelet packets offer a more complex and flexible anal- approximation and a detail. The approximation is then itself
ysis, because in wavelet packet analysis, the details as well split into a second-level approximation and detail, and the pro-
as the approximations are split (Fig. 3). Single wavelet packet cess is repeated.
decomposition gives a lot of bases from which you can look The WPT analysis in this paper gives important features of
for the best representation with respect to a design objective. the extent of frequency of the heart sounds (S1 or S2) and car-
The wavelet packet method is thus a generalization of wavelet diac murmur. These features can help clinicians in their diag-
decomposition that offers a richer range of possibilities for nosis or in recognizing pathological conditions concerning the
signal analysis. In wavelet analysis, a signal is split into an recording PCG signals.
Fig. 12. (a) CWT application of the normal PCG; (b) contour plot of the surface in (a); (c) CWT application of the sound S1; (d) contour plot of the surface
in (c); (e) CWT application of the sound S2; and (f) contour plot of the surface in (e).
Table 1
Temporal and frequential measurements related to the components A2 and P2
Localisation of the delay “d” (ms) Minimal frequency (in scale) Maximal frequency (in scale) Frequency extent
A2 13 19 124 105
P2 18 116 98
Fig. 13. Continuous wavelet analysis (CWT) for the normal PCG and abnormal PCG.
0.2
amplitude
S1 S2 N
0
-0.2
0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
0.5
amplitude
S1 CA
S2
0
-0.5
0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000
0.5
amplitude
MS
0
-0.5
0 1000 2000 3000 4000 5000 6000 7000
0.5
amplitude
AR
0
-0.5
0 1000 2000 3000 4000 5000 6000
1
amplitude
DR
0
-1
0 1000 2000 3000 4000 5000 6000
0.5
amplitude
AI
0
-0.5
0 1000 2000 3000 4000 5000 6000
time (samples)
Fig. 14. PCG signals used: (a) normùal (N); (b) the coarctation of the aorta (CA); (c) the mitral stenosis (MS); (d) the aortic regurgitation (AR); (e) the
diastolic ruble (DR); and (f) the aortic insufficiency (AI).
Fig. 16 provides a TF representation of one cardiac cy- frequency as time progresses. It was shown that basic fre-
cle of the heart sounds concerning the normal PCG signal quency content of PCG signal can be easily provided using
(Fig. 16(a)), the coarctation of the aorta case (Fig. 16(b)) and the FFT technique. However, time duration and transient
the mitral stenosis case (Fig. 16(c)). These figures show the variation cannot be resolved; the WT therefore is a suit-
frequency range of each component or murmur of the PCG sig- able technique to analyse such a signal. It was also shown
nal studied. Thus, we can observe the component “C” adding that the coefficients of the continuous wavelet transform
of the two major components for the sound S2 (A2 and P2). (CWT) give a good graphic representation that provides a
The same results have been found in Section 3.5 (Fig. 14(b), quantitative analysis simultaneously in time and frequency.
detail d7). It is therefore very helpful in extracting clinically useful
Fig. 17 has the advantage of presenting at the same time information.
the frequency extent of the various components of the cardiac The measurement of the time difference between the A2
sound such as their frequency sites compared to one another. and P2 components in sound S2, the number of major com-
We can show clearly that the diastolic ruble case has a high- ponents of sounds S1 and S2 and the frequency range and
frequency content than the AI case or the AR case. duration for all these components and sounds can be ac-
curately achieved for CWT simultaneously as was clearly
4. Conclusion illustrated.
It is found that WT is capable of detecting the four ma-
The cardiac (heartbeat sound) cycle of phonocardiogram jor components of the first sound S1 and the two compo-
(PCG) is characterized by transients and fast changes in nents (the aortic valve component A2 and the pulmonary
normal PCG
0.2
amplitude
S1 S2
0
-0.2
0 1000 2000 3000 4000 5000 6000
0.02
M1 T1 tim e(sam ples ) A2 P2
d7
0
.02
-0
0 1000 2000 3000 4000 5000 6000
0.1
tim e(sam ples )
d6
0
-0.1
0 1000 2000 3000 4000 5000 6000
0.2
tim e(sam ples )
d5
-0.2
0 1000 2000 3000 4000 5000 6000
0.1
tim e(sam ples )
d4
0
-0.1
0 1000 2000 3000 4000 5000 6000
0.02
tim e(sam ples ) S2
d3
0
-0.02
0 1000 2000 3000 4000 5000 6000
0.01
tim e(sam ples )
d2
0
-0.01
0 1000 2000 3000 4000 5000 6000
0.01
tim e(sam ples )
d1
0
-0.01
0 1000 2000 3000 4000 5000 6000
time (samples)
coarctation of the aorta

amplitude
0.5
0 S1 S2
-0.5
0.02 0 1000 2000 3000 4000 5000 6000
C
d7
0
-0.02
0.2 0 1000 2000 3000 4000 5000 6000
d6
0
-0.2
0.2 0 1000 2000 3000 4000 5000 6000
d5
0
-0.2
0.2 0 1000 2000 3000 4000 5000 6000
d4
0
-0.2
0.1 0 1000 2000 3000 4000 5000 6000
d3
0 S2
-0.1
0.02 0 1000 2000 3000 4000 5000 6000
d2
0
-0.02
0.01 0 1000 2000 3000 4000 5000 6000
d1
0
-0.01
0 1000 2000 3000 4000 5000 6000
time (samples)
mitral stenosis
0.5
amplitude
0 S1 + murmur COM S2
-0.5
0.05 0 1000 2000 3000 4000 5000 6000 7000 8000
d7
-0.05 S1
0.2 0 1000 2000 3000 4000 5000 6000 7000 8000
A2 P2
d6
0
-0.2
0.2
0 1000 2000 3000 4000 5000 6000 7000 8000
d5
-0.2
0.1 0 1000 2000 3000 4000 5000 6000 7000 8000
d4
-0.1
0.05 0 1000 2000 3000 4000 5000 6000 7000 8000
d3
0 S1 S2
-0.05
0.01 0 1000 2000 3000 4000 5000 6000 7000 8000
d2
-0.01
0.01
0 1000 2000 3000 4000 5000 6000 7000 8000
d1
0
-0.01
0 1000 2000 3000 4000 5000 6000 7000 8000

time (samples)
Fig. 15. Discrete wavelet transform (DWT) analysis for: (a) the normal PCG; (b) the coarctation of the aorta; and (c) the mitral stenosis.
normal PCG
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42
41 S1
40
39 S2
38
37
36
35
34
33
32
31
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000
time (samples)
coarctation of the aorta
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42 S2
41
40 S1
39
38
37
36
35 c
34
33
32
31
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000
time (samples)
mitral stenosis
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42 S1 + murmur
41
S2
40
39
38
37
36
35
34
33
32
31
1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
time (samples)
Fig. 16. Wavelet packet transform (WPT) analysis for: (a) the normal PCG; (b) the coarctation of the aorta; and (c) the mitral stenosis.
valve component P2) of the second sound S2 of a nor- FFT can display the frequencies of the components A2
mal PCG signal. These components are not accurately and P2 but cannot display the time delay between
detectable using STFT or WD. However, the standard them.
aortic insufficiency
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
1000 2000 3000 4000 5000 6000 7000 8000
time (samples)
aortic regurgitation
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000
time (samples)
diastolic rumble
62
61
60
59
58
57
56
55
54
53
52
51
50
49
48
scales
47
46
45
44
43
42
41
40
39
38
37
36
35
34
33
32
31
500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5500 6000
time (samples)
Fig. 17. Wavelet packet transform (WPT) analysis for: (a) the aortic insufficiency (Ai); (b) the aortic regurgitation (AR); and (c) the diastolic rumble (DR).
The application of STFT in the analysis of the PCG signals murmurs (OS, EC) can influence the TF content of the principal
made it possible to obtain appreciable information on the TF sounds S1 and S2 and have a frequency extent more significant
content of the sounds S1, S2 and of the added murmurs (OS, than them.
EC or width murmurs). Finally, in fact the width murmurs (PS and AR cases) present
Under the normal conditions (N) or in the presence of similar a frequency extent that is very significant. Discrimination be-
signals (CA), the frequency content of the sound S2 is more tween the systolic and diastolic murmurs can be made starting
significant than that of the sound S1. We noted that the light from this frequency extent, diastolic murmurs thus having a
frequency extent more significant than those of systolic mur- [10] A. Harris, G. Sutton, M. Towers, Physiological and Clinical Aspects of
murs. In many cases these murmurs seem not to affect the TF Cardiac Auscultation, Medicine Ltd., London, Uk, 1976.
content of the sounds S1 and S2 too much. [11] Y. Meyer, Ondelettes et Opérateurs. Tome1, Hermann, Paris, 1990.
[12] S. Mallat, A theory for multiresolution signals decomposition: the
The two versions of analysis of WT (DWT and PWT) make wavelet representation, IEEE Trans. Pattern Anal. Mach. Intell. 11 (1989)
it possible to gather time-frequency information concerning the 674–693.
characteristics of cardiac sounds. [13] P.I.J. Keeton, F.S. Schlindwein, Application of Wavelets in Doppler
The studied techniques (FT, STFT, WD, CWT, DWT and Ultrasound, Emerald Sensor Review, vol. 17(1), MCB University Press,
PWT) of analysis can thus be regarded as complementary in the 1997, pp. 38–45.
[14] S.M. Debbal, F. Bereksi-Reguig, The fast Fourier and the wavelet
TF analysis of the PCG signal, each relating to a part distinct transforms analysis of the cardiacs sound, Phys. Chem. News (PCN) 15
from the analysis in question. (2004) 54–59.
It is shown that these techniques provide more information [15] S.M. Debbal, F. Bereksi-Reguig, The fast Fourier transform and the
on PCG signals, which will help physicians obtain qualitative continuous wavelet transform analysis of the phonocardiogram signal,
and quantitative measurements of the time and the TF PCG J. Mech. Med. Biol. 4 (3) (2004) 257–272 (ISSN: 0219-5194).
[16] P.M. Bentley, A. Grant, J.T.E. Mc Donnel, Time–frequency and time-
signal characteristics and consequently aid to diagnosis. scale for the classification of native and bioprosthetic valve sounds,
IEEE Trans. Biomed. Eng. 45 (1) (1998) 125–128.
References [17] A. Djebbari, Synthèse des méthodes d’analyse temporelle, spectrale et
spectro-temporelle du signal phonocardiogramme, Electronic Magister
[1] R.M. Rangayyan, R.J. Lehner, A review, CRC Crit. Rev. Biomed. Eng. Thesis of Signals and Systems, Departement of Electronics, Faculty of
15 (3) (1988) 211–236. Science Engineering, University Aboubekr Belkaid Tlemcen, Algeria,
[2] R.M. Rangayyan, R.J. Lehner, Phonocardiogram signal analysis: a 1999, pp. 18–24.
review, CRC Crit. Rev. Biomed. Eng. 15 (3) (1988) 211–236.
[3] L.P. Feigen, Physical characteristics of sound and hearing, Am. J. Cardiol.
28 (2) (1971) 130–133.
S.M. Debbal received the engineering degree in Electronics from the Uni-
[4] F.B. Tuteur, Wavelet transforms in signal detection, IEEE ICASSP
versity of Science and Technology, Oran, Algeria, in 1981. He received the
CH2561-9 (1988) 1435–1438. diploma of doctorate in processing signal from Aboubekr Belkaid Univer-
[5] A. Grossmann, R. Holschneider Kronland-Martinet, J. Morlet, Detection sity, Tlemcen, Algeria in 2004. Since 2002, he has been working as research
of abrupt changes in sound signal with the help of the wavelet transform, member, involved in many research and developmental work in co-operation
in: Inverse Problems: An Interdisciplinary Study. Advances in Electronics with medical professionals, at the University Hospital in Tlemcen, Algeria.
and Electron Physics, supplement 19, Academic Press, New York, 1987, His current research interest includes instrumentation, sensors and biomedical
pp. 298–306. signal processing and particularly time-frequency phonocardiogram analysis.
[6] M.S. Obaidat, Phonocardiogram signal analysis: techniques and
performance comparison, J. Med. Eng. Technol. 17 (6) (1993) 221–227.
[7] B. Boashas, Time–frequency signal analysis, in: S. Haykin (Ed.), F. Bereksi-Reguig received the engineering degree in Electronics from the
University of Science and Technology, Oran, Algeria, in 1983 and the MSc
Advances in Spectrum Estimation, Prentice-Hall, Englewood Cliffs, NJ,
and PhD degrees in Modern Electronics from the University of Nottingham,
1993. England, in 1985 and 1989, respectively. Currently, he is a Professor in the
[8] J. William, Williams, Time–frequency and Wavelets in Biomedical Signal Department of Electronics at the University of Tlemcen, Algeria, and the
Processing, in: A. Metin (Ed.), IEEE Press Serie in BME, 1997, pp. 3–8. Director of the research laboratory in Biomedical Engineering. His area of
[9] R. Olivier, P. Duhamel, Fast algorithms for discrete and continuous research interests includes biomedical signal processing and microcomputer-
wavelet transforms, IEEE Trans. Inf. Theory 38 (2) (1992) 569–586. based medical instrumentation.

Computerized Heart Sounds Analysis: S.M. Debbal, F. Bereksi-Reguig

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Computerized Heart Sounds Analysis: S.M. Debbal, F. Bereksi-Reguig

Uploaded by

Copyright:

Available Formats

Computers in Biology and Medicine 38 (2008) 263 – 280

Computerized heart sounds analysis

Received 23 December 2006; accepted 26 September 2007

1. Introduction normal cardiac cycle. Heartbeat sound analysis by auscultation

cross-terms is by smoothing the TF plane, but this will be at the

aortic-stenosis frequency spectrum

aortic-insuficiency frequency spectrum

mitral-stenosis frequency spectrum

-0.2 Systole Diastole

0 0.5 1 1.5 2 2.5 3

-0.2 Systole Diastole

0 0.5 1 1.5 2 2.5 3

0 0.5 1 1.5 2 2.5 3

0 0.5 1 1.5 2 2.5 3

500 to be 13 ms. It is smaller than the 30 ms [17] as foreseen in the

coarctation of the aorta

0 1000 2000 3000 4000 5000 6000 7000 8000

coarctation of the aorta

You might also like