You are on page 1of 4

CLASSIFICATION OF MICROORGANISMS USING IMAGE PROCESSING

TECHNIQUES

Teresa Alvarez, Yolanda Martin, Susana Pe'rez, Francisco Santos,


Fernando Tadeo, Susana Gonzalez
Department of Engineering Science and Automatic Control, University of Valladolid

Jose' Luis Arribas


Empresa Depuradora de Aguas Residuales, Avila

Pastora Vega
Department of Computer Science and Automatic Control, University of Salamanca

ABSTRACT - Changes in the quality of the water affect their


numbers and types.
This paper presents an approach to the automatic - There are diatoms adapted to many different
classification of microorganisms based on image- pollution levels.
processing techniques. A computer application that
processes and classifies the images has been developed.
The classification is carried out using a competitive
neural network. Its input pattern is evaluated from a
frequency analysis of images taken through microscope.
This paper focuses on the application of this technique
to the classification of living diatoms.

Figure 1: Diatom
TAX'IBD [6] and OMNIDIA [19] are two
1. INTRODUCTION examples of programs that can be used to calculate the
IBD, but the classification process is perfomed by the
As it is well known, image processing is a mature area, user (choosing among different images the most
which has successfully solved many problems in similar). ADIAC [ 5 ] is a European project that has
different areas, such as Medicine, Astronomy or begun developing tools for automatically classifying
Engineering ([21], [ l l ] , [20] and the references therein). diatoms. Some of the results are presented in [24]. This
With this work, we propose to use these techniques to work is more a quantitative study than a qualitative one
solve a problem in Microbiology: the automatic [24], we focus on identifying and classifying the algae
classification of microorganisms from their images instead of counting the number of them.
taken using light microscopes This paper summarizes our study of image
The growth of some species of microorganisms is processing techniques applied to the automatic analysis
directly related with the temperature, light intensity and and recognition of diatom images. It has been developed
physiochemical characteristics of the environment (pH, a computer application that performs the image
salinity, the presence of organic matter, etc.). processing and the classification.
Environmental changes can be studied using these Next section describes the approaches followed for
microorganisms. the image processing and the generation of reference
In particular, the identification of the type and patterns. Then, we justify the reason for choosing a
amount of diatoms in a sample eases the calculation of competitive neural net. Finally some results and
+the Biological Diatom Index (BDI). The BDI is a conclusions are discussed.
numerical value [ 11 that estimate the water quality based
' on the presence/absence and number of certain species
2. IMAGE ENHANCEMENT
of diatoms. Diatoms or Bacillariophyceae (Figure 1) are
microscopic unicellular algae with silica shell that At this stage, the goal is the enhancement of the images
inhabit all aquatic environments [13]. These of diatoms because there are more microorganisms and
microorganisms are chosen as bioindicators because: impurities in the image. Diatoms can be discriminated
- They live in aquatic environments as different as
very well using their contour. The rest of the body is
fresh, salt and brackish waters, running and stagnant transparent and the levels of intensity correspond with
water, dripping and intermittent waters, etc. those of the background of the image [ 131 (Figure 1).

0-7803-6725-1/011$10.00 02001 IEEE ,329


We have implemented spatial domain methods amplitude pixels and that the intensity is concentrated in
([20], [21]) because we consider they are a good a small interval. The processed image (Figure 5) shows
approach for our problem and they are not very a diatom that stands out the background.
computationally or time demanding. The contrast
stretcing is done block by block. This way, memory 3. DIATOM RECOGNITION
requirements variations in luminosity can be corrected
(from left to right or up-down). Image recognition ([9], [20], [15], [22]) implies the
existence of some characteristics or descriptors that
2.1. 'Block' operation allow concluding which kind of object or situation is
represented in the image (in our case, the diatoms).
The image is divided in areas of equal size (Figure 2, at The ability to perform pattern recognition is
the right hand side, a magnified view of a block can be fundamental to image analysis. In our cases, we work
observed) that will be called 'blocks'. Each region will with quantitative descriptions that have been obtained
be processed independently. Some advantages are: from the frequency analysis of the image. So, the
patterns we consider are vectors of real values.

Figure 2: Block Obtention


- The decision parameters of the algorithm are 10 20 30 40 53 M
selected from the characteristics of each block. So,
there are no side effects from other areas of the Figure 4: Histogram
image that could worsen the result.
- An intensily compensation of the image is done.
- As the me1hod works with subimages, less memory
is needed t13 store the intermediate data generated by
the algorithm.

2.2. Contrast s.tretching


Compensation techniques increase the image contrast
and enhance the non transparent areas of the diatoms. In Figure 5: Processed image
particular, contrast stretching increases the dynamic
range of the gray levels in the image being processed. 3.1. Generating patterns for diatoms images
We have applied a window-level contrast stretching
[20] (Figure 3), such that S = R", S is the final contrast In order to recognize the microorganism that appears in
range, R is the initial contrast range and y i s a positive the image, it will be necessary to analyze certain spectral
parameter called gamma correction. properties or characteristics that distinguish one
output 1 4 corpuscle from the other. We use information based on
the frequency components of the images, generated from
the power spectrum (or spectral density) of the images.
The power spectrum of bidimensional images
corresponds to a real bidimensional matrix where each
component is defined by:
Figure 3: Contrast Stretching
where
We have scaled the input range to the total range, M-I N-l
giving as output an image with the microorganisms
scaled towards the black.
x=oy=o
2.3. Example the Fqurier transform of the digital image f(x,y) and
F(u,v) is its conjugate [17].
Figure 1 shows a diatom before the image processing The image input to the frequency analysis is the image
techniques are applied. The histogram (Figure 4) reveals output of the methods presented in 2.1. The 2-D
that the image contains relatively few low or high

330
spectrum is (diagonal inverse) symmetric even if the these connections and the input vectors belong to the
component at frequency0 has or has not been moved to intervals [0, 11 and [-1, 11. And they are normalized to
the centre of the frequency plane. So, it is only unity.
necessary to consider one of the spectrum quadrants.
Even so, the features vector is too big and the 4.2. LVQ nets and diatom classification
classification cannot be done fast enough. For instance,
one of this microscopic images can generate an intensity There are four values in the pattern, so the input vector
vector of 100 to 400 values. We need to obtain some has four elements. The number of neurons in the
parameters from the spectrum of the images in such a competitive layer (hidden layer) and the number of
way that the features vector decreases its size linear neurons in the output layer are chosen following
dramatically and contains the information necessary to the classification goals. For instance, if there are five
classify the images. So, we decided to use as parameters types of microorganisms, then there are five output
the spectral maximum, the total energy and the neurons. The links between the competitive layer and
bandwidth where the 95% of the intensity lies (in the output layer establish which and how many neurons
horizontal and vertical directions). We found of the first layer belongs to a certain class, the other
experimentally that these values summarizes rather well links are kept out because they represent links with non
the features that discriminate one images from other. adequate classes.
Once the features vector has been calculated, the
classification can be done. 5. COMPUTER APPLICATION

4. MICROORGANISMS CLASSIFICATION As it was summarized in the abstract, we have


developed a computer application (Figure 7) that
4.1. LVQ neural nets processes and classifies images of microorganisms. The
tool recognizes successfully the types of diatoms for
We have chosen neural nets [12], [14] as the which it has been trained.
classification tool because they have been applied The user interface has been programmed in C++.
successfully to many different problems in pattern The neural net was trained using the Neural Nets
recognition. Moreover, there are promising results in the toolbox of Matlab. We have used Mideva to compile the
analysis of microscopic images in industry: sugar Matlab code into C++.
factories (growth of crystal sugars [lo]), food industry This program performs typical operations such as
[7], .... rotating, selecting an area or zooming the image, among
Some years ago, people that worked on Artificial others. It is possible to write and print informs and many
Vision preferred "feed-forward'' nets (such as more things. But what it is really interesting is that
Perceptron). Then, the self-organized nets [2] spread and images of microorganisms are processed via a simple
the ART nets [I71 were used as visual patterns and not wide used technique (block operation) and
classifiers. During the last years, [2], [3], [23] Learning contrast strectched giving as result images where the
Vector Quantification (LVQ) neural nets (Figure 6) have diatoms are clearly distinguished by their contour.
been used because as it has been reported in the The application has been evaluated with several
literature, they give very good results in image images. The results have been satisfactory: the diatoms
classificationconsidering frequency information. are correctly classified when they belong to the trained
sets or rejected otherwise.
Competitive
layer
6. CONCLUSIONS AND FUTURE WORK

Neural nets have proved to be a valuable tool for


classifying water images using their frequency
Inputs components. It has also been stated the importance of
the preprocessing stage prior to the feature extraction.
Figure 6: LVQ Neural Net Future works considers the inclusion of the
presence/absence of certain colors in the feature pattern.
We have used competitive learnig to train the neural
Moreover, it is now required that the user rotates the
net. In competitive learning only a single output neuron
image until the main axis of the microorganism lies
is active at any one time. This method of learning
horizontally. This is cumbersome. A possible approach
divides the input set in groups or classes according to
is to detect automatically the direction using the Hough
their properties.
transform. Other solution would be to substitute the
Our net uses a non-supervised algorithm in the
feature pattern with a pattern generated using the radial
hidden layer (so, the classes can be grouped) and a
frequency vectors (that are independent of the direction)
supervised algorithm in the output layer (so the user sees
and the variance along the points at the same distance of
at which class belongs the image).
the origin. Other hture work could be the classification
In a LVQ neural net, there connections among all
of this type of images considering frequency and
the output units and all the input units. The weights of
morphologic properties.

33 1
Figure 7 : Computer application
8. ACKNOWLEDGMENTS [ 121, Haykin. Neural Networks. A comprehensive
Foundation. MacMillan College. Publishing Company,
This work has been funded by the Junta de Castilla y 1994.
Leon, project refe:rence C01/200F. A special thanks to
[13] Krammer, K., Lange-Bertalot H.,
Prof CCsar de Prada (University of Valladolid, Spain)
Bacillariophyceae I., 2., 3. and 4. G Fischer Verlag,
for his knowledge and encouragement.
Stuttgart, 1986-91
9. REFERENCES [I41 Krose, B. J. A., P.P. van der Smagt. “An
introduction to Neural Networks”. University of
[ 13 AFNOR, Norme fianqaise expCnmentale, Amsterdam. 1993.
“DCtermination cle I’Indice Biologique DiatomCes
[15], Kulkarni, A. D., “Artificial Neural Networks for
(IBD), 2000.
Image Understanding”. International Thomson
[2] Ahalt, S. C., A. K. Krishnamurthy, P. Chen y D. E. Publishing, 1994.
Melton. “Competitive Learning Algorithms for Vector
[I61 Tadeo, F., D., Matia and T. Alvarez, “Detection of
Quantization”. Neural Networks. Vol. 3. 1990
phases in sugar crystallization using wavelets”,
[3] Bezdek, J.C., N.R. Pal. 1995. “Two Soft Relatives submitted to ICIP’2001.
of Learning Vectlx Quantization”. Neural Networks.
[17] Maren, A., C. Harston y R. Pap. Handbook of
Vol 8, No 5, 1995.
Neural Computing Applications, Academic Press Inc.,
[4] Boyle, R.D., Thomas R. C., Computer Vision: A 1990.
First Course, I3lackwell Scientific Publications,
[18] Martin, Y. and S. Perez, Sistema de
Oxford, 1988.
procesamiento de imagenes pura una planta
[5] Buf, H. du, M. Bayer, S. Droop, R. Head, S. depuradora de uguus residuules, Final Degree Projet,
Juggins, S. Fischer, H. Bunke, M. Wilkinson, J. July 2000.
Roerdink, J. E’ech-Pacheco, G. Cristobal, H.
[19] Plaza A., Cemadas E., Durin M.L., Sinchez J.M.,
Shahbazkia, and A. Ciobanu. “Diatom identification: A
Peton M.J., Detection Automitica de Vetas de Grasa
double challenge called ADIAC”, Proceedings
en Jamon Iberico. Jornadas de Automitica, Salamanca,
ICIAP’99, Venice. pp. 734-739, 1999.
Septiembre 1999
[6] Caron, D., “TAX’IBD“, Institut SupeCrieur
[20] Pratt W. K., Digital Image Processing, John
d’Agriculture de Lille, Abril 2000
Wiley & Sons, New York, 1991.
[7] CLCI, OMNIDIA Version 3, http://uerso.club-
[21] Russ, J. C., The image processing Handbook.
internet.fr/clci, 1999
Second Edition. CRC Press, 1994.
[SI Demuth, H., M. Beale. Neural Network Toolbox
[22] Russell, S., P. Norvig. Artificial Intelligence. A
for Use With Matjab. User’s guide. Version 3.0. Junio
Modern Approach, Prentice Hall, 1995.
1992.
[23] Song, H., S. Lee. “LVQ Combined with
[9] Gonzalez R.C., Digital Image Processing,
Simulated Annealing for Optimal Design og Large-Set
Addison-Wesley, 1992.
Reference Models”. Neural Networks, Vol. 9, No 2.
[IO] Gonzalez, S. E., “Sensor Inteligente para el
[24] Wilkinson, M.H.F., J.B.T.M., Roerdink, S., Droop
Proceso de Cristalizacion del Anicar”. Ingenieria de
and M., Bayer, “Diatom Contour Analysis using
Sistema y Automatica. Tesis Doctoral, Univ. de
Morphological Curvature Scale Spaces”, ICPMOOO,
Valladolid. Abril 1988.
Barcelona, pp. 656-659,2000.
[ 1I ] Harlick, R. and L., Shapiro, Computer and Robot
Vision, 1992

332

You might also like