Professional Documents
Culture Documents
net/publication/335347386
CITATIONS READS
3 1,306
5 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Valérie Louis-Dorr on 24 August 2020.
A R T I C L E I N F O A B S T R A C T
Keywords Human parasites are a real public health problem in tropical countries, especially in underdeveloped countries.
Intestinal parasites Usually, the medical diagnosis of intestinal parasites is carried out in the laboratory by visual analysis of stools
Wavelet edge detector samples using the optical microscope. The parasite recognition is realized by comparing its shape with known
Hough transform
forms. We offer a solution to automate the diagnosis of intestinal parasites through their images obtained from a
Active contours
microscope connected directly to a computer. Our approach exploits the contour detection based on the multi-
Probabilistic neural network
scale wavelet transform for detecting the parasite. Active contours are combined with the Hough transform to
perform image segmentation and extraction of the parasite. We used principal component analysis for the
extraction and reduction of features obtained directly from pixels of the extracted parasite image. Our classifi
cation tool is based on the probabilistic neural network. The obtained algorithms were tested on 900 samples of
microscopic images of 15 different species of intestinal parasites. The result shows a 100% recognition rate of
success.
1. Introduction by the type of characteristics used by the classifier. In Ref. [2], the au
thors were interested in identifying human helminthes eggs through
The parasite is an organism that lives at the expense of its host, which artificial neural networks (ANN). Widmer et al. [3] focused on the
is also an organism. Intestinal parasites are a form of human parasitosis. recognition of Giardia cysts and Cryptosporidium oocysts using the
Approximately four billion people are affected in the world [1]. Intes artificial neural network and immunofluorescence microscopy. Using
tinal parasites are responsible for physical or behavioral disturbances in Bayesian classification, Castanon et al. [4] identified seven species of
children, in immuno-deficient persons, and in worst cases, can cause Eimeria. Ginoris et al. [5,6] used an artificial neural network to recog
death. The diagnosis of intestinal parasites is carried out in the labora nize metazoa and protozoa, that are commonly found in the mud.
tory by observing stools samples through an optical microscope. The Dogantekin et al. [7] and Avci et al. [8] used support vector machines
parasite identification is done by comparing the shape observed with (SVM) and a fuzzy inference system based on an adaptive network, to
known forms. This practice is time consuming and laborious. In addi recognize helminthes eggs. In Ref. [9], the authors proposed a diagnostic
tion, this clinical test is slow and prone to many errors of diagnosis. method for roundworm and whipworm eggs using the parameters of
There is no reliable quality control. The aim of our research in this shape, roundness and dimension. Their classifier is based on a filtering
article is to contribute to solving these problems. system with determination of stable thresholds. However, these studies
In the literature, we found some studies devoted to the medical di do not address the identification of human intestinal protozoa, and the
agnostics of intestinal parasites based upon microscopic image analysis. parasites segmentation step is manual. Suzuki et al. [10] proposed a
Parasitic organisms have at certain stages of its development, well significant advance towards the automation of parasites diagnosis by
known morphologies. They therefore lend themselves to pattern image analysis. Their approach is the first to focus on 15 species of
recognition techniques. Various approaches in the literature can be protozoa and helminthes among the most common in Brazil. The image
distinguished either by the parasite species involved in classification, or analysis method used in Ref. [10] has three main steps: the image
https://doi.org/10.1016/j.imu.2019.100238
segmentation that locates and delimits objects present in the image; the detected.
description of the forms that extracts the features of the segmented
object; the classification that uses these features to perform the recog 2.1. Acquisition of the microscopic images of stools
nition of the species of parasites. These features include the parameters
of roundness, geodesic distances, curvature variance, texture, perimeter, The microscope provides a representation of what is not visible to the
and area of the parasite. The classifier used in Ref. [10] is based on the naked eye. An optical microscope coupled to an acquisition sensor was
image foresting transform (IFT) method. A performance analysis of al used in our study. It will permit us to access the image of parasites for
gorithms for the identification of intestinal parasites is proposed in recognition. Images of stools are captured via microscope and trans
Ref. [11]. In Ref. [12], the authors proposed a system for the identifi ferred to computer for analysis and diagnosis as shown in Fig. 2. As can
cation and quantification of pathogenic helminth eggs using a segmen be seen in Fig. 2, to acquire stools images, we used glass slides and
tation by the watershed method. Their classification system uses the microscopy slides; an optical microscope with digital camera eyepiece
nearest neighbor algorithm. This analysis remains limited to the eggs of and PC plug: “TRAVELER MICROSCOPE”, with its illuminated pen, PC
two species of helminthes (roundworms and whipworms). installation drivers and a video capture program; a laptop: Intel (R)
Saha et al. proposed in Ref. [13] the first and partial solution devoted Pentium (R) M 2.26GHz processor, 504MB RAM, Microsoft Windows XP
to the recognition of 9 species of amoebic cysts. In this paper, a complete SP1, 55GB hard disk with more than 5GB free, a HI color display screen
system for automated medical diagnosis of human intestinal parasites is (1024 � 768 pixels). Note that the characteristics of our computer are
proposed. As well as in Ref. [13], our method is based on the processing much higher than the minimum required for this microscope namely:
of microscopic images and an artificial neural network system. The steps Pentium III 800Mhz or more, recommended for digital video (DV),
of our approach are edge detection, image segmentation and pattern 256MB RAM or more, Microsoft Windows 98 SE, Windows Me, Windows
recognition. One of the main novelties of our method over the existing 2000 or Windows XP; at least 500 MB of available hard disk memory
techniques is that the step of detection and extraction of parasites in (4GB is recommended); a True Color or HI color display (1024 � 768
microscopic images is fully automated. Another difference of our pixels).
method over previous methods of parasite diagnosis is the descriptor The microscopic images of stools that weused were obtained in a
type of characteristics that we used. Our features descriptor directly uses parasitology laboratory of the Public Regional Hospital and in a private
the pixel of the image and does not need to compute other parameters. clinical laboratory “Clab-Labo” both in the town of Bafoussam-
Our identification tool uses principal components analysis (PCA) in the Cameroon. More images of parasitized stools were obtained from data
reduction of dimensionality and the probabilistic neural network for bases available online [14,15].
classification. In addition, the varieties of intestinal parasites are
extended to 15 in this work. Also, the results of the different steps of our
system are detailed and the models used are justified here. 2.2. Segmentation and extraction of the parasite
In the next section, we present the equipment and methodology used.
This requires image acquisition, segmentation and recognition. In sec Generally, a microscopic image of stools contains many unnecessary
tion III, the experimental results are provided with detailed discussion. items for diagnosis. In addition, there are many parasites in a single
This is followed by a discussion in section IV and the conclusion in image. Then, before its recognition, each parasite must be individually
section V. extracted. The parasite extraction is done through the process of seg
mentation. Segmentation techniques are either region based or contours
2. Materials and methods based methods. Firstly, methods based on the region use the intrinsic
properties of objects that are to be extracted. These methods depend
The main objective of our system consists in identifying and recog heavily on image characteristics and the shape to be extracted. The
nizing the varieties of human parasites in feces slides. For this purpose, second category of methods is based on the outline that seeks the con
we designed a system whose functioning is given by the block diagram tours of the objects to be extracted by using the discontinuity of the
illustrated in Fig. 1. After acquiring microscopic stools images, our image intensity. The edge detection processes either the maxima of
system makes image segmentation and extracts the parasites. Classifi gradient or the zero crossing of the Laplacian of the image intensity
cation features are then extracted from the image. These features are function. In Ref. [16], Tchiotsop et al. have shown that the edge detection
input to the neural network. After training and testing, the classifier using multi-scale wavelet produces better results than other conven
provides the result of the recognition by displaying the type of parasite tional edge detectors, and particularly when it is applied on microscopic
images of feces. The main advantage of the multi-scale wavelet
2
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
transform in edge detection is the ability to choose the size of the details 2.2.2. Principle of the hough transform
that will be detected. Nevertheless, the outlines obtained are often split. The Hough transform (HT) is a tool used to determine significant
The Hough transform can extract parametric shapes in an image. It is groups of characteristic points that meet some parametric constraints. It
used in computer vision problems such as the detection of lines, circles, is based on the general principles defined by parametric constraint of the
ellipses or other curves. The Hough transform is successfully used on form of the following equation:
ultrasound images by Golemati et al. [17] to highlight transverse and
f ðX; AÞ ¼ 0 (3)
longitudinal sections of the carotid artery. The form of certain intestinal
parasites such as amoebic cysts is circular. Thus, a circular model of the
where X ¼ ðx1 ; x2 ; :::; xn ÞT is a point of the image and A is a vector of
Hough transform can be easily used for their detection and extraction.
parameters.
Meanwhile, other irregular shapes will not be totally located. The active
The constraints can represent curves (lines, circles, etc.), the surfaces
contours technique, also called the snake model, is another segmenta
(planes, cylinders, etc.) or the movement trajectories (of translation,
tion approach. It is very effective in contours detection. An imple
rotation, etc.), depending on the interpretation of the feature point. For
mentation example of this method is presented in Ref. [18]. The high
our case, the feature points are edge pixels obtained from the multi-scale
dependence of active contours to the initial contour is a major disad
wavelet transform. However, they could also be of the gray level values.
vantage. When the initial outline is near to the target contour, the active
We are interested to the detection of circular shapes in the image. In
contours algorithm converges quickly. The combination of snake model
Cartesian coordinates, the equation of a circle is given as follows:
with the Hough transform is feasible. The Hough transform allows one to
automatically locate the parasite region of interest. This computational ðx aÞ2 þ ðy bÞ2 ¼ r2 (4)
result is then considered as the initial outline for the active contours. In
Ref. [19], this method is successfully utilized in the extraction of human where ða; bÞ represents the coordinates of the center and r is the radius of
parasites on microscopic images of stools. Segmentation uses the outline the circle.
map of the image to separate the parasite from its background. This In the parametric constraints of equation (3), the parameter vector is
image background of the parasite is next deleted using a logic operation. given by equation (5)
Thus, our segmentation algorithm uses the active contours combined
Aða; b; rÞ (5)
with the Hough transform. The multi-scale wavelet transform is used to
detect the edges of the parasites. Here, X ¼ ðx; yÞ is a point of the image.
For each pixel in the contour, we want to know if this pixel belongs to
2.2.1. Principle of the edge detection based on the multi-scale wavelet a circle. Thus we find the place for parameters of that circle. This can be
transform seen from equation (4) that x and y are considered to be fixed points,
It is shown in Ref. [20] that the wavelet transform Wa fðx; yÞ of an while a, b and r are varying. The basic method uses a three-dimensional
image fðx; yÞ is proportional to the first derivative smoothed by a accumulator array A (a,b,r). Each contour element vote for all the circles
convolution kernel θðx; yÞ at a given scale a. This proportional rela to which it may belong and the peak of the accumulator array A (a,b,r) is
tionship is given by the following equations: sought. This peak gives the position of the circle and its radius. If it
0 1 happens that we know the radius in advance, we will only need a 2-D
∂
1
!
B ðf *θa Þðx; yÞ C array accumulator. The full version of the Hough transform algorithm
Wa f ðx; yÞ B ∂x C can be found in Ref. [19] and references therein.
Wa f ðx; yÞ ¼ ¼ aB C ¼ arðf *θa Þðx; yÞ (1)
Wa2 f ðx; yÞ @∂ A
ðf *θa Þðx; yÞ
∂y 2.2.3. Contour optimization through the gradient vector flow (GVF) active
� � contours
with. θa ðx; yÞ ¼ 1a θ xa; ay The method of active contours is based on the construction of an
To locate the edges on an image, the wavelet transform module is energy functional that measures the relevance of the outline. The model
first computed. Its local maxima is then determined in the direction of of the Gradient Vector Flow (GVF) is one of the snake models that was
the gradientrðf *θa Þðx; yÞ. The modulus-angle representation of the developed in order to increase the capture range and improve the ability
wavelet transform, at each scale a (a ¼ 2j ;j 2 ℕ) is defined in Refs. [16, of the snake model to move within the limits of concavities. The GVF
20] by the following equation: model addresses these problems by introducing a new external force.
The Gradient Vector Flow field is defined to minimize the following
8 qffi�ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi�ffiffiffiffiffiffiffiffiffi�ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi�ffiffiffi
> 2 2 energy functional [21]:
> M2j f ðx; yÞ ¼ �w12j f ðx; yÞ � þ �w22j f ðx; yÞ �
>
>
< ZZ n � � o
2
! (2) E¼ μ: u2x þ u2y þ v2x þ v2y þ jrf j2 :jV rf j2 dx:dy (6)
>
> A j f ðx; yÞ ¼ tan 1 w2j f ðx; yÞ
>
>
: 2 1
w2j f ðx; yÞ
where the vector field is V (x,y)¼(u (x,y),v (x,y)), f is the edge map of the
� �
M2j fðx; yÞ is the modulus of the wavelet transform. A2j fðx; yÞis the angle image, rf ¼ fx ; fy is the gradient of the image edge map. The regu
between the gradient vectorrðf *θ2j Þðx; yÞ and the X-axis of the image larization parameter denoted by μ controls the relative effect of the two
plane (x, y). For the convolution kernel θðx; yÞ, we used the Gaussian terms. μ is set in function to the amount of noise present in the image
function, which offers algorithmic advantages [16]. (the more μ is increased, the greater is the noise level).
The edge detection based on the multi-scale wavelet transform For our case, a value of 0.2 was used for μ. When we minimize the
consists to firstly detect, at each scale, the local maxima of the modulus energy functional of Equation (6), the following Euler independent
of the wavelet transform. For local maxima detection, a hysteresis equations are derived [22]:
threshold is used. This kind of threshold is defined by an interval [TL, 8 � �
TH], with TL ¼ 0.4*TH. Afterwards, these local maxima are chained >
< μ:r2 u ðu fx Þ: fx2 þ fy2 ¼ 0
from the coarser scale (a>1) to the finest scale (a ¼ 1). The goal of the �� � (7)
>
chaining is to solve the location problems caused by the convolution at : μ:r2 v v fy : fx2 þ fy2 ¼ 0
the large scale of the analysis. The full version of the contours detector
based on the multi-scale wavelet transform can be found in Ref. [16]. Equation (7) are resolved by considering u and v as functions of time.
We get:
3
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
Vt ¼ ðut ðx; y; tÞ; vt ðx; y; tÞ Þ largest eigenvalues. The projection of the data in the new basis defined
by the principal components is carried out by the scalar product of
with: original data with the arranged eigenvectors.
8 � � Practically, the PCA consists to look for a transformation of matrix W
< ut ðx; y; tÞ ¼ μ:r2 uðx; y; tÞ ðuðx; y; tÞ fx ðx; yÞ Þ: fx ðx; yÞ2 þ fy ðx; yÞ2 that corresponds to each characteristic vector CX defining in the set X
�� �
: vt ðx; y; tÞ ¼ μ:r2 vðx; y; tÞ vðx; y; tÞ fy ðx; yÞ : fx ðx; yÞ2 þ fy ðx; yÞ2 another vector of characteristics CY for the set Y, such that the covariance
matrix of the elements in Y is diagonal. This transformation is linear and
(8)
is defined as follows:
where fx and fy denote the derivative of f with respect to x and y CY ¼ CX W T (11)
respectively. The gradient vector flow is the solution of the system of
equations (8). Since these equations are decoupled, they can be resolved The matrix W can be found by solving the following equation:
separately in u and v, as scalar partial derivatives equations. Σ X wi ¼ λi wi ; (12)
After calculating the Gradient Vector Flow, the snake GVF is defined
in Ref. [21] as the parametric curve ψ satisfying the following dynamic λi defines the eigenvalues and wi defines the eigenvectors. Σ X Is the
equation: covariance matrix of X.
Since the eigenvectors wi are known, the transformation matrix Wis
Xt ðs; tÞ ¼ α:X’’ β:X’’’’ þ k:VðXÞ (9)
obtained by considering the wi as its columns. For the reduction of the
Here, X is a function of time t and space s. β Is the rigidity parameter characteristics, W is a matrix of dimension m � n, containing n eigen
and α is the tension parameter of the snake. The parameter k controls the vectors (wi ;i ¼ 1:::n) which correspond to the n largest eigenvalues. For
extent to which the GVF field affects the deformation of the curve. our case, m ¼ 144 and n ¼ 2.
X’’andX’’’’designate the second and fourth derivatives of X with respect
to s, respectively. After its discretization, the dynamic equation (9) is 2.4. Principle of the recognition and classification using the probabilistic
solved by iteration. From its initial curve, the iterative equations for the neural network
deformation of the snake are given in Ref. [21] by:
�
xt ¼ ðA þ γ:IÞ 1 ðγ:xt 1 þ k:uðxt 1 ; yt 1 Þ Þ The artificial neural networks (ANN) are very popular tools for
Xt ¼ ðxt ; yt Þ ¼ (10) pattern recognition. Probabilistic neural networks (PNN) are a special
yt ¼ ðA þ γ:IÞ 1 ðγ:yt 1 þ k:vðxt 1 ; yt 1 Þ Þ
type of radial basis neural network. It is an alternative of radias basis
where x and y denote the coordinates of the vectors of points on the networks for classification problems. the learning easiness and its
curve, γ ¼ 1/Δt is the step size of the iteration, I is an identity diagonal instantaneousness process are the main advantages of PNN [26–29].
matrix and A is a pentadiagonal matrix with boundary conditions In our work, we used a probabilistic neural network. Its architecture
established for the snake used for longitudinal images. The snake is is given in Fig. 3. At the entrance of the network, there is a column vector
dynamically reparameterized after each iteration to maintain a separa P of R lines. The radial inner layer subtracts the input vector from the
tion point in the limit of 0.5–1.5 pixels [22]. weight vector W of this layer, and we obtain W–P. The resulting vector is
The outlines search procedure by the snake method depends on other next multiplied element by element with the bias vector b. The obtained
mechanisms such as interaction with a user or mechanism of high-level output S1 is used as the argument of the radial function “radbas”; thus
vision of the computer. First, the snake is placed near the contour of the we obtain a. The activation function in the radial inner layer is defined
�
region of interest. To achieve this first step, we used the Hough trans by radbasðnÞ ¼ exp n2 . The output a is multiplied by the weight
form to automatically find the initial curve of the snake. The second step matrix of the competitive layer LW, and S2 is obtained. The output S2
is the deformation of the initial curve (circle given by HT). This initial serves as an argument for the competitive function C. The competitive
curve is considered as a snake that uses the GVF field as the external layer directs each entry in one of the K classes used in the learning step.
force. Details of the extraction algorithm is given in Ref. [19]. The output S of the competitive function is a column vector of K lines.
The competitive functionC produces a 1 corresponding to the largest
2.3. The feature extraction using principal components analysis (PCA) component of S2, and 0 otherwise.
For training and testing the probabilistic network, the reduced
The principal components analysis (PCA) is a conventional linear characteristics are applied to the input. In our case, a total of 1800 im
method of feature extraction. It is based on the second order statistical ages of microscopic parasites are used. These samples were divided into
analysis of the data, and in particular the analysis of eigenvalues of the two sets of 900 images each. The first set, constructed from 60 images for
covariance matrix. The basic idea is that in many such measures, the each of the 15 types of parasites, was used for training. The second set of
mobservational variables ðx1 ; x2 ; :::; xm Þ can be well represented by a 900 images was considered for testing, with the same allocation by type
parametric surface of n dimensionðy1 ; y2 ; ::: ; yn Þ, with n smaller than of parasite. The weight matrix W of the radial inner layer is a Q � R
m. matrix formed from training samples. W contains in each line R principal
PCA has been used in the dimensionality reduction of the multi- components of a training sample. 900 samples are available for training
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
variable data [23–25]. It consists of projecting the data in the di in this work, thus Q ¼ 900. The radial layer bias are all set to ln 0:5=s,
rections of their maximum variability. Thus, a family of variables is where s denotes the network spread constant. In our case, the spread
replaced by new variables of maximum variance, uncorrelated and constant has been adjusted to 0.4 by experience. This is the value that
which are linear combinations of the original variables. The principal yielded an improved classification rate. The weight matrix of competi
components are basis vectors of the directions in descending order of tive layer LW is set to a K � Q matrix of Q vectors of target class. K is
variability. The basic vector for the direction of the highest variability is equal to 15 in our case.
provided by the first principal component. The second principal
component gives the basis vector for the next direction orthogonal to the 3. Results
first principal component, and so on. The calculation of the principal
components requires computation of the covariance matrix, and calcu For our experimentation, we used microscopic images with a
lation of eigenvalues, with storage of eigenvectors according to the magnification of � 400 for helminthes eggs and � 1000 for amoeba
descending order of eigenvalues. For the reduction of features, only the cysts. The image format is JPEG and the size is 360 � 400.
n (with n � m) first eigenvectors will be selected, corresponding to the An example of a microscopic image of feces is showing in Fig. 4. This
4
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
image contains the parasitic elements and many other insignificant therefore reveal the various characteritics of the parasite. Other results
items. In this image, there is a round egg shape of a tapeworm. This egg of the application of the multi-scale wavelets transform, and its perfor
has about 30 μm in size. A second egg found, of ovoid form, is from the mance on the edge detection of intestinal parasites in stools microscopic
whipworm, having a size of about 50 μm. One can also note the presence images, can be found in Ref. [16]. These results of the edge detection
of food debris. Our goal is to detect the parasite, then extract and show the effectiveness, and justify the choice of this algorithm in the
recognize it. Before the recognition phase, we need another important parasite detection phase.
intermediate step, which is feature reduction. Its role is to facilitate the
recognition step by reducing the number of entries in the classification 3.2. Results of the parasite extraction
system.
We applied our segmentation algorithm based on the Hough trans
form and active contours on microscopic images of stools containing
3.1. Results of the parasite detection
parasites, in order to detect and extract the parasites. Fig. 6 shows the
extraction of various parasite cysts of Entamoeba coli. These cysts can
We applied the proposed edge detector to a large number of micro
have a spherical or ovoid form, and are about 15–25 μm in diameter,
scopic images of feces containing human parasites. Here the results
depending upon maturity. Fig. 6 (A) shows an image with 7 cysts. In the
obtained on some of these images were presented. Fig. 5 shows the re
images (B), (C), (D) and (E) of this figure, the parasites are extracted
sults obtained by applying the edge detection algorithm on the micro
individually after the adjustment of analysis parameters (scale,
scopic image of stools described above in Fig. 4. We can note in Fig. 5
threshold and radius of the circular Hough transform). Fig. 6 (F) shows
that the variation of the analysis scale of the wavelet transform reveals
all four extracted parasites, colored on the same initial image. By
the different structures present in the stools image. For example, as can
varying the analysis parameters, all cysts present in this image can be
be seen in Fig. 5, on the images (A) and (E), for the two thresholding
extracted. In Fig. 7, we can also observe other parasites extracted
values used ([0.08, 0.2] and [0.12, 0.3]), the results obtained at scale 1
automatically from an initial outline obtained through the Hough
of the wavelet transform are still confused and contain many unnec
transform. The images (A-B- ... G) are the microscopic images of feces to
essary edges. However, in image panels (C), (D) (G) and (H), the results
analyze. the images (A1-B1 …- G1) show the edge images obtained from
obtained at scales 6 and 8 with the same thresholdings correctly locate
the multi-scale wavelet process (in black on the figure), on which are
almost all desired edges (D). Thus, in this case, the scales 6 and 8 are
superimposed the initial contour given by the Hough transform (blue).
better than scale 1 for the edge detection of parasites on the considered
They are obtained respectively with the scale, high threshold TH and
image. The threshold variation and the variation of analyzing scale can
radius parameters of (4, 0.6, 50 pixels) for A1; (2, 0.5, 40 pixels) for B1;
(8, 0.75, 75 pixels) for C1; (6, 0.65, 80 pixels) for D1; (8, 0.4, 35 pixels)
for E1; (8, 0.6, 60 pixels) for F1 and (8, 0.6, 35 pixels) for G1. The
extracted parasite resulted from the convergence of the active contours
is given in the images (A2-B2- … -G2) of the same figure. The analysis
radius used by the Hough transform is related to the size of the parasite.
It is expressed in pixels. It also depends on the resolution of the camera
used, and the magnification of the microscope. Thus, a specific type of
parasite can be searched automatically using its size.-
5
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
Fig. 5. Analysis scale variation and edge detection by the wavelet transform. (A) Contours analysis at scale 1 with a thresholding of [0.08, 0.2]; (B) Contours at scale
4 with a threshold of [0.08, 0.2]; (C) Contours at scale 6 with a thresholding of [0.08, 0.2]; (D) Contours at scale 8 with a threshold of [0.08, 0.2]; (E) Contours at
scale 1 with a threshold of [0.12, 0.3]; (F) Contours at scale 4 with a thresholding of [0.12, 0.3]; (G) Contours at scale 6 with a threshold of [0.12, 0.3]; (H) Contours
at scale 8 with a threshold of [0.12, 0.3].
data. For this goal, the dimension of the space of the new features is
selected according to the classification accuracy. It also depends upon
the system complexity.
Fig. 8 shows a matrix containing a set of 300 images of parasites. We
can distinguish 15 species of parasites. For each species, there are 20
samples. These samples are used to configure our system. Table 1 shows
the result of the implementation of the PCA on this set of parasites.
Those are the first three principal components (PCA1, PCA2 and PCA3). Fig. 7. Initial contours and parasites extracted. (A-B ... F) are microscopic
The values in this table are the average and the maximum deviation images of feces for analysis. (A1-B1 … F1) are the outlines obtained from the
obtained on the 20 samples of each class (A, B, C, D, E, F, G, H, I, J, K, L, multi-scale wavelet transform, which are superimposed on the initial contours
M, N, O). To facilitate classification, a good feature reduction should given by the Hough transform. (A2-B2 … F2) are the images of the para
provide a set of new features with close values for the same classes, and sites extracted.
dispersed values for different classes. To evaluate our features reduction
tool, independent component analysis (ICA) was also applied to the upon the ICA method (Table 2). For the PCA, the values of the same class
same image samples. The ICA is a non-linear method of feature reduc are fairly close together (Table 1). This observation can also be shown in
tion [30]. Table 2 shows results for ICA. Also, 3 components are retained Figs. 9 and 10. In Fig. 9, which provides the values of the first two
(ICA1, ICA2 and ICA3). When analyzing these tables, we can see the components, for every 300 samples of 15 classes for the ICA, there is no
wide disparity of values for the same species of parasite, when it is based parasite class regrouped. The dispersion rate is very high. For the PCA,
6
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
Table 2
Average values of the first three components of the ICA obtained for 15 classes of
20 samples of parasites. Different mean values are added to the maximum
dispersion of these values compared to the average. In this case, we can see that
these differences are high compared to the value considered. Similarly, there are
very small differences or overlaps between the different average values. This
complicates the classification phase.
Features
7
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
Fig. 9. ICA projection in two dimensions. Each of the points on this figure corresponds to the representation of the two characteristics obtained for a parasite image.
We note that for the images of the same class, the points are widely dispersed. In this case, the network discrimination function of classification is difficult to obtain.
Fig. 10. PCA Projection in two dimensions. Each of the points on this figure corresponds to the representation of the two characteristics obtained from a parasite
image. We note that, for images of the same class, the points are grouped separately from those of other classes. In this case, the network discrimination function of
the classification is easy to obtain.
the choice which contributes to the best diagnostic. The particularity of transform performs well for edge detection of intestinal parasites. The
the microscopic images of stools is that it contains enough noise, and main advantage of this edge detection method is the scale of the anal
many other elements not need for the parasite detection. For the first ysis. As can be seen in Fig. 5, the image is analyzed on many scales and
step, we need to detect the edge of the parasite. We used the multi-scale the retained result reveals the contours of the intestinal parasite and
wavelet transform. In Ref. [16], it is shown that the multi-scale wavelet rejects the contours of undesirable elements. The second step uses the
8
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
5. Conclusions
Table 3
Confusion matrix of the classification system: 60 samples of parasites per class; the test gives a rate of 100% correct recognition (60/60) for all of the 15 parasite classes
considered.
Target classes
Classes A B C D E F G H I J K L M N O
current classes A 60 0 0 0 0 0 0 0 0 0 0 0 0 0 0
B 0 60 0 0 0 0 0 0 0 0 0 0 0 0 0
C 0 0 60 0 0 0 0 0 0 0 0 0 0 0 0
D 0 0 0 60 0 0 0 0 0 0 0 0 0 0 0
E 0 0 0 0 60 0 0 0 0 0 0 0 0 0 0
F 0 0 0 0 0 60 0 0 0 0 0 0 0 0 0
G 0 0 0 0 0 0 60 0 0 0 0 0 0 0 0
H 0 0 0 0 0 0 0 60 0 0 0 0 0 0 0
I 0 0 0 0 0 0 0 0 60 0 0 0 0 0 0
J 0 0 0 0 0 0 0 0 0 60 0 0 0 0 0
K 0 0 0 0 0 0 0 0 0 0 60 0 0 0 0
L 0 0 0 0 0 0 0 0 0 0 0 60 0 0 0
M 0 0 0 0 0 0 0 0 0 0 0 0 60 0 0
N 0 0 0 0 0 0 0 0 0 0 0 0 0 60 0
O 0 0 0 0 0 0 0 0 0 0 0 0 0 0 60
9
B.S. Tchinda et al. Informatics in Medicine Unlocked 16 (2019) 100238
of a medical system for the automatic diagnosis of human intestinal images. IEEE (Inst Electr Electron Eng) Trans Biomed Eng 2013;60(3):803–12.
https://doi.org/10.1109/TBME.2012.2187204.
parasites.
[11] Gomes AP, Noguero L, Bez MR, Tavares RG. Performance analysis of software for
identification of intestinal parasites. J Bras Patol Med Lab 2015;51(4):218–23. htt
Funding and competing interests ps://doi.org/10.5935/1676-2444.20150036.
[12] Jim�enez B, Maya C, Vel� asquez G, Torner F, Arambula F, Barrios JA, Velasco M.
Identification and quantification of pathogenic helminth eggs using a digital image
There are no conflicts of interest associated with this publication and system. Exp Parasitol. Jul 2016;166:164–72. https://doi.org/10.1016/j.
there has been no significant financial support for this work that could exppara.2016.04.016.
[13] Saha Tchinda B, Tchiotsop D, Tchinda R, Wolf D, Noubom M. Automatic
have influenced its outcome. recognition of human parasite cysts on microscopic stools images using principal
component analysis and probabilistic neural network. Int J Adv Res Artif Intell
2015;4(9):26–33. https://doi.org/10.14569/IJARAI.2015.040906.
Ethical approval
[14] Pochet C. Plan �etude des formes v� eg�
etatives et kystiques des protozoaires. available
online: http://bioimage.free.fr/par_image/fichesparasito.htm.
This article does not contain any studies with human participants [15] Pochet C. Plan �etude des formes Plan d’�etude des œufs d’helminthes. available
and/or animals performed by any of the authors. online: http://bioimage.free.fr/par_image/fichesparasito.htm.
[16] Tchiotsop D, Saha Tchinda B, Tchinda R, Kenn�e G. Edge detection of intestinal
parasites in stool microscopic images using multi-scale wavelet transform. SIViP
Acknowledgments 2015;9(suppl1):121–34. https://doi.org/10.1007/s11760-014-0716-6.
[17] Golemati S, Stoitsis J, Sifakis E, Balkizas T, Nikita K. Using the Hough transform to
segment ultrasound images of longitudinal and transverse sections of the carotid
Declared none. artery. Ultrasound Med Biol 2007;33(12):1918–32. https://doi.org/10.1016/j.
ultrasmedbio.2007.05.021.
[18] Stoitsis J, Golemati S, Kendros S, Nikita KS. Automated detection of the carotid
References artery wall in B-mode ultrasound images using active contours initialized by the
Hough transform. Int. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2008:3146–349.
[1] World Health Organization. “Working to overcome the global impact of neglected https://doi.org/10.1109/IEMBS.2011.6090106.
tropical diseases”, First WHO report on neglected tropical diseases. 2010. [19] Saha Tchinda B, Tchiotsop D, Tchinda R, Kenn�e G. Automated extraction of the
[2] Yang YS, Park DK, Kim HC, Choi M, Chai J. Automatic identification of human intestinal parasite in the microscopic images using active contours and the Hough
helminthes eggs on microscopic fecal specimens using digital image processing and transform. Curr Med Imag Rev 2015;11(4):233–46. https://doi.org/10.2174/
an artificial neural network. IEEE (Inst Electr Electron Eng) Trans Biomed Eng 1573405611666150626173209.
2001;48(6):718–30. https://doi.org/10.1109/10.923789. [20] Mallat S, Hwang WL. Singularity detection and processing with wavelets. IEEE
[3] Widmer KW, Srikumar D, Pillai SD. Use of artificial neural networks to accurately Tran. Inform. Theory March 1992;38(2):617–43. https://doi.org/10.1109/
identify cryptosporidium oocyst and giardia cyst images. Appl Environ Microbiol 18.119727.
2005;71(1):80–4. https://doi.org/10.1128/AEM.71.1.80-84.2005. [21] Xu C, Prince J. Snakes, shapes, and gradient vector flow. IEEE Transactions on
[4] Castanon CAB, Fraga JS, Fernandez S, Gruber A, Costa LF. Biological shape Image Processing March 1998;7(3):359–69. https://doi.org/10.1109/83.661186.
characterization for automatic image recognition and diagnosis of protozoan [22] Lobregt S, Viergever MA. A discrete dynamic contour model. IEEE Trans. Med.
parasites of the genus Eimeria. Pattern Recognit 2007;40:1899–910. https://doi. Imag. March 1995;14(1):12–24. https://doi.org/10.1109/42.370398.
org/10.1016/j.patcog.2006.12.006. [23] Engelbrecht Ap. Computational intelligence: an introduction. second ed.
[5] Ginoris YP, Amaral AL, Nicolau A, Coelho MAZ, Ferreira EC. Development of an Chichester: John Wiley & Sons Ltd, the Atrium, Southern Gate; 2007.
image analysis procedure for identifying protozoa and metazoa typical of activated [24] Nixon SM, Aguado AS. Feature extraction and image & processing for computer
sludge system. Water Res 2007;41(12):2581–9. https://doi.org/10.1016/j. vision. third ed. Academic Press, Elsevier; 2012.
watres.2007.02.006. [25] Theodoridis S, Koutroumbas K. An introduction to pattern recognition: a MATLAB
[6] Ginoris YP, Amaral AL, Nicolau A, Coelho MAZ, Ferreira EC. Recognition of approach. fifth ed. Academic Press, Elsevier; 2010.
protozoa and metazoa using image analysis tools, discriminant analysis, neural [26] Kramer C, Mckay B, Belina J. Probabilistic neural network array architecture for
networks and decision trees. Anal Chim Acta 2008;595(1/2):160–9. https://doi. ECG classification. IEEE-EMBC and CMBEC, Theme 4: Signal Process 1995:807–8.
org/10.1016/j.aca.2006.12.055. https://doi.org/10.1109/IEMBS.1995.575373.
[7] Dogantekin E, Yilmaz M, Dogantekin A, Avci E, Sengur A. A robust technique based [27] Mao KZ, Tan K-C, Ser W. Probabilistic neural-network structure determination for
on invariant moments—ANFIS for recognition of human parasite eggs in pattern classification. IEEE transactions on neural networks Jul 2000;11(4):
microscopic image. Expert Syst Appl 2008;35(3):728–38. https://doi.org/10.1016 1009–16. https://doi.org/10.1109/72.857781.
/j.eswa.2007.07.020. [28] Wu SG, Bao FS, Xu EY, Wang Y, Chang Y, Xiang Q. A leaf recognition algorithm for
[8] Avci D, Varol A. An expert diagnosis system for classification of human parasite plant classification using probabilistic neural network. In: IEEE international
eggs based on multi- class SVM. Expert Syst Appl 2009;36(1):43–8. https://doi. symposium on signal processing and information technology; 2007. p. 11–6.
org/10.1016/j.eswa.2007.09.012. https://doi.org/10.1109/ISSPIT.2007.4458016.
[9] Kamarul HG, Raafat SH, Zeehaida M. Automated system for diagnosis intestinal [29] Hagan MT, Demuth HB, Beale MH. Neural network design. second ed. eBook:
parasites by computerized image analysis. Mod Appl Sci 2013;7(5):98–114. Campus Pub. Service, University of Colorado Bookstore; 2002.
https://doi.org/10.5539/mas.v7n5p98. [30] Martis RJ, Acharya UR, ChooMin L. ECG beat classification using PCA, LDA, ICA
[10] Suzuki CTN, Gomes JF, Falcao AX, Papa JP, Hoshino-Shimizu S. Automatic and Discrete Wavelet Transform. Biomedical Signal Processing and Control,
segmentation and classification of human intestinal parasites from microscopy September 2013;8(5):437–48. https://doi.org/10.1016/j.bspc.2013.01.005.
10