Professional Documents
Culture Documents
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219 209
210 S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS 211
heterogeneous set. Every time an existing system is proportion of lymphoid cells with cleaved nuclei were
aimed to augment with new cell groups, the training not included in our work. All patients with HCL
process has to be initiated from scratch with larger showed lymphoid cells with phenotype CD11c+,
and varied number of descriptors and the final setup CD25+, FMC7+, CD103+, and CD123+. Patients with
of a new recognition system. MCL had lymphoid cells with phenotype CD5+,
Consequently, this article includes methodological FMC7+, CD43+, CD10 , and BCL6 . In MCL cases,
advances in the cell characterization and recognition. cyclin D1 was overexpressed and t(11;14) was found.
Moreover, it describes a patient-based experimental Leukemic cells from follicular lymphoma were posi-
evaluation: The input to the classification system is a tive for the following B-cell-associated antigens:
set of digital cell images obtained from a PB blood CD19, CD20, CD22, CD79a, BCL2+, BCL6+, CD10+,
smear of an individual patient selected for image anal- CD5 , and CD43 . B-prolymphocyte images were
ysis by the pathologist, based on the typical morphol- selected by their morphology (larger cells with nucle-
ogy of the disease group. The system output is the oli) in patients with CLL. Blood smears were obtained
separation of the image set according to the different from the routine workload of the Core Laboratory of
groups included in the study. The efficiency is ana- Hospital Clinic of Barcelona. Blood was collected into
lyzed focusing on two ways of screening: tubes containing K3EDTA as anticoagulant. Samples
• The correct classification of the patient cell images were analyzed by a cell counter Advia 2120 (Siemens
in one of the following three big groups: normal, reac- Healthcare Diagnosis, Deerfield, IL, USA) and PB films
tive, or abnormal lymphocytes. This allows discern were automatically stained with May–Gr€ unwald–
among malignancy and nonmalignancy. Giemsa in the SP1000i (Sysmex, Kobe, Japan) within
• In case of malignancy, the correct recognition in 4 h of blood collection.
one of the five abnormal lymphocytes considered in
the system.
Digital image acquisition and processing
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
212 S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS
PATTERN RECOGNITION
IMAGE PROCESSING
Preprocessing
Training Feature
Segmentation SVM classification
cell images selection
Feature extraction
The best
The best features
SVM classifier
Figure 1. Block diagram illustrating the steps followed to develop the recognition system. A training set of lymphoid
cell images are processed, including segmentation, to obtain a database of features. A recognition module is built
which includes a classifier based on support vector machines (SVMs), whose input is a reduced number of most
relevant features. The final selection of the best features and the tuning of the best classifier are performed jointly
through an iterative process, involving 10-fold cross-validation, where the quality of the features is determined by
the efficiency of the classifier.
Segmentation is a major step in digital image pro- information that different types of components may
cessing, where cells are separated from other objects offer about the different lymphoid cells, for example,
in the image. In this work, we used a previously other colors such as cyan, magenta, and blue (from
developed algorithm [17], which was proven to be CMYK), or attributes such as hue, saturation and
efficient to automatically separate the three regions of brightness (in HSV), lightness (L), or chromaticity (a,
interest: nucleus, cell, and peripheral zone around the b, u, v). For each region of interest of every seg-
cell. The cytoplasm was identified by the difference mented image, a histogram was calculated for each
between the whole cell and the nucleus. color space component. Histograms display the pro-
portion of pixels on the image with specific intensity
values. For each histogram, six classical first-order sta-
Feature extraction
tistical features were calculated: mean, standard devia-
From the segmented images, it is necessary to identify tion, skewness, kurtosis, energy (uniformity), and
a set of quantitative descriptors that are relevant for entropy 1 (variability).
the morphological analysis and the subsequent classi- Texture refers to spatial patterns of color or intensi-
fication. Usually, three types of descriptors are used: ties, which can be visually observed and quantita-
geometric, color, and texture. Geometric descriptors tively represented using statistical tools. To do this,
are the most intuitive and closely related to the visual the co-occurrence matrix is used, which defines the
patterns that are familiar to pathologists. We extracted probability that pairs of neighbor pixels have similar
13 parameters related to size and shape of both intensities [21]. In practice, this probability is com-
nucleus and cell [16], such as nucleus/cytoplasm puted as the frequency of occurrences (second-order
ratio, nucleus perimeter, cytoplasmic profile, and histogram) divided by the number of neighbor pixels.
others. In this work, histograms were obtained for each color
Color is a physical property very common in the space component, and then, 17 second-order statisti-
visual characterization of blood cells, and different cal descriptors [22] were calculated for each his-
spaces can be used for a quantitative representation. togram.
For instance, in RGB space, each color is decomposed In this study, texture was additionally characterized
in three bands corresponding to basic colors as red, using two more mathematical tools. One is the wave-
green, and blue. In this work, we used six different let transform, which is widely used in image process-
color spaces [20]: RGB, CMYK, XYZ, L*a*b*, L*u*v, ing to analyze trends and fluctuations. The analysis of
and HSV. Our idea was to explore the rich amount of fluctuations in an image serves to characterize texture
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS 213
[23]. In this work, the wavelet transform was used for morphologic visual appearance. All the true diagnoses
each cell image and then 12 statistical measures of the patients were confirmed by the corresponding
(means and standard deviations) were calculated as complementary tests [3]. Images were obtained using
texture features. Granulometry approaches texture the same smear staining and the same acquisition
description by measuring the particle size distribution device as in the development stage, but they were not
in an image by a series of operations within the math- used before in the training. These are two important
ematical morphology [24]. In this work, eight granu- issues to remark, as pattern recognition, involving
lometric features were calculated. training and comparisons with reference characteris-
Finally, 43 features were defined for color and tex- tics, is the essential paradigm in building the recogni-
ture. When considering the six color spaces, we had tion approach.
19 components, so that we had 817 features. As we
applied them separately to nucleus, cytoplasm and the
R E S U LT S
whole cell, a total of 2451 color/texture features were
obtained. All of them were determined for each image
System development
and stored in a numerical data set. Readers may find
more technical details about those features and their As explained before, we obtained 2464 features in the
calculations in [20]. extraction step, 13 of them being geometric and the
remaining color and texture features. Two issues were
particularly important in deriving the classification
Cell recognition
system: (i) the selection of a reduced number of fea-
The objective of this step was to build a pattern recogni- tures and (ii) the tuning of the parameters of the
tion module (see Figure 1) composed by two elements: SVM classifier [20, 25, 26]. Both issues are intercon-
(i) a set of most relevant features selected among those nected, as illustrated by the feedback arrow in Fig-
extracted in the training stage and (ii) an algorithm ure 1, as the quality of the selected features is
(classifier) to perform the automatic identification measured by the accuracy in the classification. Itera-
among normal, reactive lymphocytes, and the five B tive tests were performed where the classifier perfor-
neoplastic lymphoid cell subsets. To face the significant mance was evaluated using the 10-fold cross-
number of lymphoid groups under study, several classi- validation technique [17] over the whole image train-
fication techniques were analyzed, the so-called sup- ing set. At the end of this iterative tuning process, the
port vector machine (SVM) with a radial basis function best 150 features were ranked based on maximizing
kernel [20, 25] being finally selected. their relevance and minimizing their redundancy
The overall classification accuracy and three more [27], and the classifier was finally designed and ready
performance parameters were used for the classifica- to be used in the further experimental study. Among
tion evaluation: (i) sensitivity or true positive rate these features, six were geometric (easily inter-
(TPR), (ii) specificity or true negative rate (TNR), and pretable). The remaining 144 were color and texture
(iii) precision or positive predictive value (PPV). features, which have an abstract interpretation in
relation to the cell morphology.
To give some insight on these features, Table 1 lists
Experimental recognition tests
the 15 most relevant, where nucleus/cytoplasm ratio
After the selection of the best features and the tuning (N/C), nucleus perimeter, cell diameter, and cytoplas-
of the optimal classifier, a recognition system was mic external profile (hairiness) are geometric. The
assembled. The system input is a set of digital cell remaining features are related to color and texture
images obtained from a PB smear of an individual and we may observe that the six color spaces adopted
patient. The system output is the classification of this in this work are involved. For instance, feature 4 is a
set into the different groups under study. To assess statistical parameter describing the uniformity of the
the system effectiveness, a number of 21 patients and histogram of the hue component of the color space
sets of the corresponding images (946 in total) were HSV. Feature 5 is related to texture (granulometry)
selected by the pathologist (AM) based on their and calculated from the XYZ space, while 6 is another
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
214 S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS 215
lymphocytes (RL). This is shown in Table 4. The diag- images corresponding to individual patients were
onal values represent the true positive rates for each automatically identified in the group corresponding to
of the arranged groups: 100% for N, 97.9% for ALC, the true diagnosis.
and 95.7% for RL. The overall cell classification
accuracy was 97.67%.
DISCUSSION
Figure 3 shows individual images from five patients
of the experimental group. The first row shows lym- Recently, the International Council of Standardization
phoid cell images (patient 7) that the system identified in Hematology (ICSH) recommended using the term
as HCL cells (32/32, 100%). The second row shows reactive to describe lymphocytes with morphological
lymphoid cells that the system identified 63/65 (97%) changes related to benign etiology and the term abnor-
as FL cells (patient 9, Table 3), diagnosis confirmed by mal to describe lymphocytes with a suspected malig-
complementary tests. A total of 55/71 of the cell nancy [5]. The morphologic discrimination among
images were classified as CLL cells (77%) and 8/71 neoplastic B lymphomas is a significant added value
were recognized as BPL in patient 12 and some of that may be complementary to other additional tests,
these images are shown in row 3. Fourth row corre- which cannot be interpreted accurately without
sponds to some of the 29/39 lymphoid cell images examining the peripheral blood film [15]. However,
(74%) that were classified as MCL cells (patient 14). these lymphoid cells are the most difficult to classify
Finally, the fifth row corresponds to some of the 29/ using only morphological features [4], and interpreta-
29 lymphoid cell images (100%) that were classified tion is based on individual experience. New tech-
as reactive lymphocytes (patient 19). Most of the cell niques such as automated recognition systems are not
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
216 S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS 217
Table 3. Classification results obtained in the experimental testing of the recognition system. For each single
patient, a number of images were acquired (third column) and classified into the seven different groups. Global
cell classification accuracy = 91.23%
True Cell
Patient diagnosis images Accuracy N HCL FL CLL MCL BPL RL
1 N 16 100.0 16 0 0 0 0 0 0
2 N 14 100.0 14 0 0 0 0 0 0
3 N 11 100.0 11 0 0 0 0 0 0
4 N 15 100.0 15 0 0 0 0 0 0
5 HCL 20 95.0 1 19 0 0 0 0 0
6 HCL 52 100.0 0 52 0 0 0 0 0
7 HCL 32 100.0 0 32 0 0 0 0 0
8 FL 81 70.4 0 0 57 1 23 0 0
9 FL 65 96.9 0 0 63 0 2 0 0
10 FL 73 93.2 1 0 68 2 2 0 0
11 CLL 165 98.2 1 0 2 162 0 0 0
12 CLL 71 77.5 4 1 2 55 1 8 0
13 MCL 13 46.2 1 0 5 1 6 0 0
14 MCL 39 74.4 0 0 1 0 29 7 2
15 BPL 64 95.3 1 0 0 0 2 61 0
16 BPL 53 90.6 3 0 0 0 1 48 1
17 RL 10 100.0 0 0 0 0 0 0 10
18 RL 53 98.1 0 1 0 0 0 0 52
19 RL 29 100.0 0 0 0 0 0 0 29
20 RL 26 88.5 0 0 2 0 0 1 23
21 RL 44 93.2 0 1 1 0 0 1 41
HCL, hairy cell leukemia; MCL, mantle cell leukemia; FL, follicular lymphoma; CLL, chronic lymphocytic leukemia;
BPL, B-prolymphocytes; N, normal lymphocytes; RL, reactive lymphocytes. The bold values represent the number of
true positives predicted by the classifier.
Table 4. Results of Table 3 arranged in three groups: recognize five different types of normal leukocytes but
normal lymphocytes (N), reactive lymphocytes (RL), only one subtype of abnormal lymphoid cells (CLL).
and abnormal lymphoid cells (ALC). The rows
represent the true diagnosis and the columns the
In addition, a method was proposed [13] that was
predicted diagnosis. The diagonal values represent able to recognize five types of blood cells, but the
the global accuracy for each group. The values are in images selected were precursor lymphoid cells and
percentage with respect to the total number of each myeloid blast cells from acute leukemia patients.
(true) type of cell The need for recognition of abnormal lymphocytes
Predicted should be recognized, as some studies have reported a
significant variability in the classification of normal,
N ALC RL reactive, and abnormal lymphocytes. For instance, a
True N 100 0 0 survey using digital cell images [14] showed that 31%
ALC 1.7 97.9 0.4 of the morphologists were not able to reproduce their
RL 0 4.3 95.7 own previous classification. New techniques, in partic-
ular automated recognition systems as the one pre-
sented in this article, may reduce the interobserver
To our knowledge, the high number of lymphoid variation and subjectivity of the classification.
cell subtypes automatically recognized in this work As a future work, other B and T abnormal cells
has not been yet reported in the literature. A previous should be characterized and included in the training and
work [30] investigated the use of SVM classifiers to recognition scheme in view to have a general system
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
218 S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS
able to classify any abnormal lymphocyte in a peripheral covering the whole spectrum of pathologies will be
blood sample. Other subsets such as large granular lym- required.
phoid cells, plasma cells, or neoplastic T cells should be A further step will be to define the minimum pro-
also considered, as well as different lineages of blast cells. portion of abnormal cells of each particular type
The patient-based experimental tests included in detected by the recognition system necessary to accept
this study have served to illustrate the potential the screening result as a guide toward the morpho-
capabilities of the proposed recognition method. A logic diagnosis. To apply this system in the hematol-
further step will be to define the minimum propor- ogy laboratory, a wider number of patients covering
tion of abnormal cells of each particular type the whole spectrum of pathologies will be required. In
detected by the recognition system necessary to addition, our next goal is to test the recognition sys-
accept the screening result as a guide toward the tem without a preselection of the lymphoid cell
morphologic diagnosis. A wider number of patients images of the patient.
REFERENCES 5. Palmer L, Briggs C, McFadden S, Zini G, ysis replace the manual differential? An
Burthem J, Rozenberg G, Proytcheva M, evaluation of the CellaVision DM96 auto-
1. Houwen B. The differential cell count. Lab Machin SJ. ICSH recommendations for the mated image analysis system. Int J Lab
Hematol 2001;7:89–100. standardization of nomenclature and grad- Hematol 2009;31:48–60.
2. Bain BJ. Diagnosis from the blood smear. N ing of peripheral blood cell morphological 8. Briggs C, Culp N, Davis B, D’Onofrio G,
Engl J Med 2005;353:498–507. features. Int J Lab Hematol 2015;37:287– Zini G, Machin SJ. ICSH guidelines for the
3. Swerdlow SH, Campo E, Harris NL, Jaffe 303. evaluation of blood cell analysers including
ES, Pileri SA, Stein H, Thiele J, Vardiman 6. Ceelie H, Dinkelaar RB, van Gelder W. those used for differential leucocyte and
JW. WHO Classification of Tumours of Examination of peripheral blood films reticulocyte counting. Int J Lab Hematol
Haematopoietic and Lymphoid Tissues. using automated microscopy; evaluation of 2014;36:613–27.
Lyon, France: IARC Press; 2008. Diffmaster Octavia and Cellavision DM96. J 9. Foran DJ, Comaniciu D, Meer P, Goodell
4. Gutierrez G, Merino A, Domingo A, Jou Clin Pathol 2007;60:72–9. LA. Computer-assisted discrimination
JM, Reverter JC. EQAS for peripheral blood 7. Briggs C, Longair I, Slavik M, Thwaite K, among malignant lymphomas and leuke-
morphology in Spain: a 6-year experience. Mills R, Thavaraja V, Foster A, Romanin D, mia using immunophenotyping, intelligent
Int J Lab Hematol 2008;30:460–6. Machin SJ. Can automated blood film anal- image repositories and telemicroscopy.
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219
S. ALFEREZ ET AL. | AUTOMATIC CLASSIFICATION OF LYMPHOID CELLS 219
IEEE Trans Inf Technol Biomed tal blood image processing. Int J Lab 24. Angulo J. A mathematical morphology
2000;4:265–73. Hematol 2014;36:472–80. approach to cell shape analysis. In: Progress
10. Sabino DMU, da Fontoura Costa L, Rizzatti 17. Alferez S, Merino A, Bigorra L, Mujica L, in Industrial Mathematics at ECMI 2006.
EG, Zago MA, Gilrizzatti E. A texture Ruiz M, Rodellar J. Automatic recognition Bonilla LL, Moscoso M, Platero G, Vega JM
approach to leukocyte recognition. Real- of atypical lymphoid cells from peripheral (eds). Berlin, Heidelberg: Springer, 2008: 2–6.
Time Imaging 2004;10:205–16. blood by digital image analysis. Am J Clin 25. Steinwart I, Christmann A. Support Vector
11. Angulo J, Klossa J, Flandrin G. Ontology- Pathol 2015;143:168–76. Machines. New York: Springer; 2008.
based lymphocyte population description 18. Gonzalez RC, Woods RE. Digital Image Pro- 26. Chang C-C, Lin C-J. LIBSVM. ACM Trans
using mathematical morphology on colour cessing (3rd Edition), 3rd edn. Upper Sad- Intell Syst Technol 2011;2:1–27.
blood images. Cell Mol Biol 2006;52:2–15. dle River, NJ: Prentice Hall; 2007. 27. Brown G, Pocock A, Zhao M-J, Lujan M.
12. Tuzel O, Yang L, Meer P, Foran DJ. Classi- 19. Bishop CM. Pattern Recognition and Conditional likelihood maximization: a uni-
fication of hematologic malignancies using Machine Learning. New York: Springer; fying framework for information theoretic
texton signatures. Pattern Anal Appl 2007; 2006. feature selection. J Mach Learn Res
10:277–90. 20. Alferez S. Methodology for Automatic Clas- 2012;13:27–66.
13. Yang L, Tuzel O, Chen W, Meer P, Salaru sification of Atypical Lymphoid Cells from 28. Merino A, Brugues R, Garcıa R, Kinder M,
G, Goodell LA, Foran DJ. PathMiner: a Peripheral Blood Cell Images. Doctoral The- Torres F, Escolar G. Estudio comparativo de
Web-based tool for computer-assisted diag- sis, Universitat Politecnica de Catalunya, la morfologıa de sangre periferica analizada
nostics in pathology. IEEE Trans Inf Tech- Barcelona, Spain, 2015. mediante el microscopio y el CellaVision
nol Biomed 2009;13:291–9. 21. Haralick RM, Shanmugam K, Dinstein I. DM96 en enfermedades hematol ogicas y
14. van der Meer W, van Gelder W, de Keijzer Textural features for image classification. no hematol ogicas (in Spanish). Rev del Lab
R, Willems H. The divergent morphological IEEE Trans Syst Man Cybern 1973;3:610–21. Clınico 2011;4:3–14.
classification of variant lymphocytes in 22. Albregtsen F. Statistical Texture Measures 29. Merino A. Manual de Citologıa de Sangre
blood smears. J Clin Pathol 2007;60:838–9. Computed From Gray Level Coocurrence Periferica (in Spanish). Madrid: Grupo
15. Hern andez AM. Peripheral blood manifes- Matrices. Image Processing Laboratory, Accion Medica; 2005.
tations of lymphoma and solid tumors. Clin Department of Informatics, University of 30. Ushizima DM, Lorena AC, de Carvalho
Lab Med 2002;22:215–52. Oslo, Norway, 1995. ACPLF. Support vector machines applied to
16. Alferez S, Merino A, Mujica LE, Ruiz M, 23. Arivazhagan S, Ganesan L. Texture classifi- white blood cell recognition. In: Fifth Inter-
Bigorra L, Rodellar J. Automatic classifica- cation using wavelet transform. Pattern national Conference on Hybrid Intelligent
tion of atypical lymphoid B cells using digi- Recognit Lett 2003;24:1513–21. Systems (HIS’05). IEEE; 2005:6.
© 2016 John Wiley & Sons Ltd, Int. Jnl. Lab. Hem. 2016, 38, 209–219