You are on page 1of 21

Expert Systems With Applications 202 (2022) 117238

Contents lists available at ScienceDirect

Expert Systems With Applications


journal homepage: www.elsevier.com/locate/eswa

Review

Classification of non-Hodgkin lymphomas based on sample entropy


signatures
Guilherme Botazzo Rozendo a ,∗, Marcelo Zanchetta do Nascimento b , Guilherme Freire Roberto b ,
Paulo Rogério de Faria d , Adriano Barbosa Silva b , Thaína Aparecida Azevedo Tosta c , Leandro
Alves Neves a
a
Department of Computer Science and Statistics (DCCE), São Paulo State University (UNESP), Rua Cristóvão Colombo, 2265, 15054-000, São José do Rio Preto,
São Paulo, Brazil
b
Faculty of Computer Science (FACOM) - Federal University of Uberlândia (UFU), Avenida João Naves de Ávila 2121, Bl.B, 38400-902, Uberlândia, Minas
Gerais, Brazil
c
Science and Technology Institute, Federal University of São Paulo (UNIFESP), Avenida Cesare Mansueto Giulio Lattes, 1201, 12247-014, São José dos Campos,
São Paulo, Brazil
d Department of Histology and Morphology, Institute of Biomedical Science, Federal University of Uberlândia (UFU), Av. Amazonas,

S/N, 38405-320, Uberlândia, MG, Brazil

ARTICLE INFO ABSTRACT

Keywords: Computational systems to provide studies and diagnoses of non-Hodgkin’s lymphomas have been increasingly
SampEn signatures developed to assist specialists in their decision-making. On the other hand, the approaches have not yet
Signature metrics fully explored the sample entropy to assess the disease’s characteristics, with indications of the best features
Non-Hodgkin’s lymphomas
and techniques to distinguish the main groups of lymphomas. Herein, we present a method that considers
Segmentation
sample entropy signatures to study chronic lymphocytic leukemia, follicular lymphoma, and mantle cell
Color normalization
lymphoma, involving images after applying multiple segmentation techniques and color normalization models.
Texture signatures were defined as feature curves obtained from multiple observations, by associating different
parameters. Each signature was calculated for windows of size 𝑚 and ranged from 1 to 4 (0.06–0.40)
tolerance levels (𝑟). The behavior of each signature was determined from the area under the curve, skewness,
maximum entropy value and area ratio, representing the signature metrics. This approach aimed to improve
the quantitative ability of this descriptor, with new interpretations and associations. Thus, all features were
provided for different functions, such as lazy learning, trees, genetic evolution, particle swarm and animal
behavior-based approaches. The best association was tested after adding noise levels of 20%, 40% and 80%.
The performances achieved with the proposal were accuracy rates between 98.72% and 99.60%. We were
able to obtain an accuracy of 98.84% for multi-class classification using only ten features, which is an amount
considerably lower than most of the state-of-the-art approaches. The proposed approach presents relevant
advances for the investigation of computer-aided diagnosis systems since it provides accurate results and
does not require preprocessing steps. Moreover, signature metrics in association with evolutionary approaches,
which provided the best results, are among the main contributions of this paper, since the details of the most
relevant features with a full understanding of the solutions are useful for the study and pattern recognition of
non-Hodgkin’s lymphomas.

1. Introduction

Non-Hodgkin’s lymphomas (NHL) are malignant tumors that affect


cells of the lymphatic system. The American Cancer Society estimates

The code (and data) in this article has been certified as Reproducible by Code Ocean: (https://codeocean.com/). More information on the Reproducibility
Badge Initiative is available at https://www.elsevier.com/physical-sciences-and-engineering/computer-science/journals.
∗ Corresponding author.
E-mail addresses: guilherme.botazzo@unesp.br (G.B. Rozendo), marcelo.zanchetta@gmail.com (M.Z. do Nascimento), guilhermefroberto@gmail.com
(G.F. Roberto), paulo.faria@ufu.br (P.R. de Faria), adrianobs@gmail.com (A.B. Silva), tosta.thaina@gmail.com (T.A.A. Tosta), leandro.neves@unesp.br
(L.A. Neves).

https://doi.org/10.1016/j.eswa.2022.117238
Received 18 January 2021; Received in revised form 6 April 2022; Accepted 10 April 2022
Available online 19 April 2022
0957-4174/© 2022 Elsevier Ltd. All rights reserved.
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

that 80,470 new cases of NHL will be diagnosed in 2022 and 20,250 providing solutions with a more complete understanding of the classifi-
NHL deaths are estimated in the United States (American Cancer Soci- cation process, as well as a faster and more accurate diagnosis (Ribeiro
ety, 2022a). In Brazil, 36,090 new cases of NHL are expected to be et al., 2018). In this context, investigations have contributed to the
diagnosed throughout the three-year period spanning from 2020 to improvement of CAD systems devoted to NHL images and supporting
2022 (Instituto Nacional de Câncer, 2022). specialists in their decision-making (Bai et al., 2019; Battula & Sharma,
The World Health Organization classifies NHL according to lym- 2018; Brancati et al., 2019; Candelero et al., 2020; do Nascimento
phocyte types and histological characteristics of tissues affected by et al., 2018; Jiang et al., 2018; Martins et al., 2019; Ribeiro et al.,
the disease, in addition to chromosomal information and the presence 2018; Roberto et al., 2021; Taino et al., 2020; Tambe et al., 2019).
of specific proteins within cell membranes (American Cancer Society, These methods have used different combinations of feature descriptors
2022b; Swerdlow et al., 2016). Among NHL types, chronic lymphocytic and classification techniques, such as Fourier and wavelet transforms,
leukemia (CLL), follicular lymphoma (FL) and mantle cell lymphoma multiscale and multidimensional approaches, morphological and non-
(MCL) are particularly important. CLL is characterized by slow prolif- morphological models, fractal analysis and deep learning strategies.
eration of lymph node B and 𝑇 cells, which are present in the blood and Part of these approaches explores texture which is determined by the
the bone marrow (American Cancer Society, 2022b; Hallek, 2017). FL spatial arrangement of the intensity levels of pixels.
occurs both in lymph nodes and bone marrow, which is composed of Regarding the importance of the previously mentioned descrip-
two types of B-cells derived from its germinal center: centrocytes and tors, features based on sample entropy (SampEn) have not been fully
centroblasts (American Cancer Society, 2022b; Xerri et al., 2016). The explored to assess NHL images (Humeau-Heurtier, 2019), although en-
MCL group proliferates throughout the lymph nodes, spleen, blood and tropy models are considered relevant for quantifying patterns in digital
bone marrow (Vose, 2017). images (Azami, da Silva, Omoto, & Humeau-Heurtier, 2019; da Silva,
The diagnosis of the NHL type is essential to define the most appro- da Silva Senra Filho, Fazan, Felipe, & Murta, 2014; dos Santos et al.,
priate treatment protocol for each patient. Diagnosis can be performed 2018; Fabbri, Goncalves, Lopes, & Bruno, 2012; Gaudêncio et al., 2020;
via a biopsy of the most accessible tumor. The confirmation of a NHL Haralick, Shanmugam, et al., 1973; Humeau-Heurtier, 2019, 2021;
type can be accomplished via microscopic evaluation of hematoxylin Morel & Humeau-Heurtier, 2021; Silva, Senra Filho, Fazan, Felipe, &
and eosin (H&E) stained samples by pathologists. Visual analysis is Junior, 2016; Yeh, Lin, & Shieh, 2011). In addition, this descriptor can
a complex task due to the significant time involved, its subjectivity be applied to analyze and classify NHL images by providing a measure
and pathological interpretation variability between specialists during of disorder of the histological structures that represent the groups under
the manual diagnosis stage (Albayrak et al., 2021; Ramachandran, investigation, considering multiple segmentation and color normaliza-
Strisciuglio, Vinekar, John, & Azzopardi, 2020). Computational vision tion techniques. In ordered systems, for instance, entropy is equal to
algorithms and image processing techniques employed in the computer- zero, and as the degree of disorder increases, entropy also increases. In
aided diagnosis system (CAD) contribute to the diagnosis stage of digital images, the disorder is related to texture (Azami et al., 2019;
digitized images of histological samples of NHL (Bai, Jiang, Li, & Ma, Humeau-Heurtier, 2019, 2021; Morel & Humeau-Heurtier, 2021). This
2019; Battula & Sharma, 2018; Brancati, De Pietro, Frucci, & Riccio, type of investigation has not been fully detailed in the context of NHL
2019; Candelero et al., 2020; do Nascimento, Martins, Tosta, & Neves, images, with the indications of the main contributions and limitations
2018; Jiang, Li, Li, & Zhou, 2018; Li et al., 2022; Martins et al., 2019; of the SampEn for distinguishing the lymphoma groups.
Ribeiro et al., 2018; Roberto, Lumini, Neves, & do Nascimento, 2021; Studies have shown that there are several challenges associated with
Taino et al., 2020; Tambe, Mahajan, Shah, Agrawal, & Garware, 2019; the use of SampEn for quantifying and classifying patterns. The capacity
Zhang, Zhang, Jiang, & Yang, 2021). of SampEn depends mostly on establishing the best combinations of its
A CAD system can consider different stages, such as image prepro- parameters, especially image sampling windows (sized 𝑚×𝑚) and a tol-
cessing, segmentation, feature extraction and classification, in order to erance level (𝑟) in order to indicate the differences between all pixels of
allow greater objectivity and less variability in the process of image two windows (da Silva Senra Filho et al., 2019; dos Santos et al., 2018;
analysis. Each stage brings important challenges for the appropriate Humeau-Heurtier, 2019; Richman & Moorman, 2000; Silva, Duque,
development of this type of system. For instance, NHL images obtained Felipe, Murta, & Humeau-Heurtier, 2018; Silva et al., 2016; Yentes
from the biopsy sample may show non-uniform staining due to the et al., 2013). This difficulty can be minimized by using the parameters
H&E process (Feng, Zhang, & Yi, 2018; Jothi & Rajam, 2017). Thus, 𝑚 and 𝑟 to define texture signatures based on SampEn, a strategy that
this problem can be minimized during the preprocessing stage by has not been entirely explored with this type of entropy, especially
applying color normalization techniques (Khan, Rajpoot, Treanor, & when NHL images are considered (Li et al., 2022). This approach avoids
Magee, 2014; Macenko et al., 2009; Reinhard, Adhikhmin, Gooch, & the need to indicate a single combination of 𝑚 and 𝑟 for texture analysis,
Shirley, 2001; Vahadane et al., 2016). This preprocessing step can in addition to expanding the quantitative capacity of SampEn and en-
slightly increase the average accuracy of texture classification (Cer- couraging new interpretations for this descriptor. It is noticed that the
nadas, Fernández-Delgado, González-Rufino, & Carrión, 2017). Also, concept of texture signatures has been used and interpreted in different
segmentation approaches are commonly applied to identify the region ways, mainly in the context of fractal techniques (Backes & Bruno,
of interest in lymphoma images (De Oliveira et al., 2013; Silva et al., 2013; Backes, Casanova, & Bruno, 2012; Căliman & Ivanovici, 2012;
2019; Tang, Li, & Xu, 2018; Tokime, Elassady, & Akhloufi, 2018; Tosta, Dong, 2008; Peleg, Naor, Hartley, & Avnir, 1984). Texture signatures
Faria, Neves, & do Nascimento, 2017; Vahadane & Sethi, 2013; Wienert are useful for organizing multiple quantifications as feature curves. This
et al., 2012), such as lymphocytes and cell nuclei, in order to allow a strategy can contribute to the pattern recognition of complex medical
correlation with the disease under investigation. In this context, a study images, such as for distinguishing the lymphoma groups. Moreover,
of the most appropriate approaches, capable of providing accurate re- the discriminative power of SampEn signatures has not been verified,
sults for the improvement of subsequent stages, especially the features especially considering the different modules that support the devel-
extraction and classification steps, comprise an important challenge in opment of CAD system, such as normalization strategies (Khan et al.,
the study of NHL images. In addition, a model considering a reduced 2014; Macenko et al., 2009; Reinhard et al., 2001; Vahadane et al.,
number of features can lead to a more comprehensive solution for the 2016), segmentation techniques (De Oliveira et al., 2013; Silva et al.,
community interested in the study of NHL. 2019; Tang et al., 2018; Tokime et al., 2018; Tosta, Faria, et al., 2017;
In order to present feasible solutions to the aforementioned chal- Vahadane & Sethi, 2013; Wienert et al., 2012) and tests with distinct
lenges, feature extraction and classification stages comprise an impor- and elevated types of noise (Feng et al., 2018; Zhu & Wu, 2004). These
tant part of the CAD systems. These steps are useful for associating advances may encourage the study of new solutions for each stage of a
the intrinsic properties of the regions of interest with the pathologies, CAD system, both for NHL samples and other contexts of H&E images.

2
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

In this paper, a method based on SampEn signatures and association 2.1. Sample entropy
with different machine learning algorithms is proposed to study the
NHL cancer groups, involving images after applying multiple segmen- Different approaches have been presented to explore entropy in
tation techniques and color normalization models. SampEn signatures different contexts (Gaudêncio et al., 2020; Humeau-Heurtier, 2019,
consisted in feature curves wherein each point represents a quantifica- 2021; Morel & Humeau-Heurtier, 2021). One such technique is named
tion obtained through the execution of SampEn with a configuration sample entropy (SampEn). SampEn was designed to measure irregu-
larity in short and noisy temporal series (Song, Crowcroft, & Zhang,
of the parameters 𝑚 and 𝑟. The values for 𝑚 and 𝑟 were evaluated in
2012). This approach was applied to assess one-dimensional signal
the range from 1 to 4 and 0.06 to 0.4, respectively. Each signature
studies, such as heartbeat variability (Costa, Goldberger, & Peng, 2002),
was analyzed with the area under the curve, skewness, maximum
electroencephalogram data (Song et al., 2012) and atrial fibrillation via
entropy and area ratios attributes, representing the signature metrics,
electrocardiogram (Alcaraz & Rieta, 2010). By taking into consideration
a strategy that had not been explored in the literature. The attributes these studies, the SampEn was able to quantify the complexity, or irreg-
were employed to elaborate the feature vectors which were provided ularity, of a signal by sampling of similar standards contained therein.
as inputs for distinct classifiers. The model was tested by considering Moreover, the irregularity was inherently related to unpredictability.
NHL images obtained from segmentation techniques (De Oliveira et al., Thus, for regular signals, patterns are predictable, and consequently,
2013; Silva et al., 2019; Tang et al., 2018; Tokime et al., 2018; Tosta, entropy is close to zero. On the other hand, the unpredictability in-
Faria, et al., 2017; Vahadane & Sethi, 2013; Wienert et al., 2012) and creases with the irregularity and, therefore, the entropy also increases.
color normalization approaches (Khan et al., 2014; Macenko et al., It is important to notice that the SampEn concept has been expanded
2009; Reinhard et al., 2001; Vahadane et al., 2016). Finally, the dis- and applied to images (da Silva et al., 2014; da Silva Senra Filho et al.,
criminative power of the best association, using a reduced number of 2019; dos Santos et al., 2018; Humeau-Heurtier, 2019, 2021; Silva
features, was evaluated under some conditions (after applying different et al., 2018, 2016; Yeh et al., 2011). However, SampEn signatures and
types of noise, in addition to comparisons with previous works and signature metrics were not explored in these works (Li et al., 2022).
other texture attributes). Then, this study has evaluated these metrics to investigate texture
irregularities applied in NHL images.
This approach will contribute to the diagnosis stage of NHL histo-
Regarding two-dimensional SampEn, texture irregularity can be
logical images, providing the following assessments: calculated from image sampling windows (sized 𝑚 × 𝑚), which are used
to measure the similarity of evaluated windows (Yeh et al., 2011). This
1. A detailed experimental analysis dedicated to improving tech- method is performed by setting a window at a point on an image and
niques for CAD systems on NHL images including color nor- comparing correspondent pixels within a sliding window that scans
malization, segmentation, feature extraction and classification the whole image by incrementally adjusting its position by one unit.
approaches; If the differences observed between all pixels of two windows are
2. Association of SampEn signatures and different classifiers for smaller than a tolerance parameter 𝑟, then the windows are considered
evaluating H&E images from an important dataset that is com- similar. SampEn is calculated by determining the logarithmic ratio of
monly investigated in the literature; the average quantity of similar patterns observed in windows 𝑚 + 1
3. Evaluation of the discriminative power of SampEn signatures on and 𝑚. If the average quantity of similar patterns in 𝑚 + 1 is equal to
histological lymphoma images, involving images obtained after the average amount determined for 𝑚, then the patterns are considered
applying multiple segmentation techniques and color normaliza- predictable and the SampEn value is defined as zero. As the number
tion models; of similar patterns decreases, the predictability of the patterns also
4. Enhancement of the quantitative ability of sample entropy with decreases and SampEn values increase. This type of image verification
shows that if the analyzed texture is uniform, the subsequent state
evaluations based on signature metrics;
(𝑚 + 1) will be predictable in relation to the previous state (𝑚), and
5. Identification of the best SampEn signature metric such as area
consequently, entropy is close to zero. This process can lead to a single
under the curve, skewness, maximum point and area ratio for
combination of 𝑚 and 𝑟 for texture analysis and limit the quantification
achieving pattern recognition and discriminative power with
and interpretation of complex images (da Silva et al., 2014; da Silva
regard to CLL, FL and MCL groups; Senra Filho et al., 2019; dos Santos et al., 2018; Humeau-Heurtier,
6. Definition of an approach that best separates the NHL cancer 2019, 2021; Silva et al., 2018, 2016; Yeh et al., 2011). On the other
groups, even when tests with elevated noise levels are applied hand, the application of SampEn in the texture analysis of digital
to the signature metrics attributes. images has advantages, mainly because the process of pattern sampling
using a sliding window is useful for quantifying the spatial distribution
This paper is organized as follows. Section 2 presents an overview of of pixels, a characteristic that is not present in methods based on
the works related to our proposal. Section 3 describes the methodology histograms, such as Shannon’s entropy approach (Wibmer et al., 2015).
for the development of the model. Section 4 indicates the tests and the In order to improve and explore the discriminative power of Sam-
results achieved in each combination. A detailed discussion is presented pEn, different strategies have been presented. For instance, SampEn can
in Section 5, considering the best combination and different conditions. be used for separating the epithelial nuclei from the stroma in H&E
Finally, we conclude the paper in Section 6, with an overview of the stained images (Ali et al., 2020), in addition to presenting relevant
obtained results and suggestions for future researches. results in different contexts.
Yeh et al. (2011) proposed an extension of the multiscale Sam-
pEn of Costa et al. (2002) for the context of digital images. In their
2. Related work: An overview method, called multiscale complexity, the coarse-grained process was
used as a multiscale strategy. The authors proposed a strategy using
a subset of randomly selected windows for computing SampEn faster.
Although the CAD systems have been presented to support special- SampEn was computed on each scale and, the scale that provided
ists in their decision-making regarding NHL groups, the application of the highest differentiation between lymphoma classes was determined
SampEn signatures, signature metrics and combination with different using the ANOVA and the Student–Newman–Keuls test. The authors
machine learning algorithms, commonly used in the different modules concluded that SampEn is a relevant technique for complex medical
of a CAD strategy, have not been explored in the literature. image analysis.

3
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Both da Silva et al. (2014) and Silva et al. (2016) applied SampEn distance using compound hierarchy of algorithms representing mor-
in rat sural nerve images. The authors explored several values for the phology (WND-CHARM) classifier provided the best result, which was
SampEn parameters and, the AUC was used as a criterion to select the 99% accurate. Nascimento, Neves, Duarte, Duarte, and Batista (2015)
combination of parameters that provided the highest power discrimi- performed the classification of NHL classes using the stationary wavelet
nation. The authors indicated that SampEn is robust for the analysis transform as a characteristic quantifier. The authors used the ANOVA
of small images. It can be useful in histological image studies, mainly method to remove possible noise and the support vector machine (SVM)
because it was possible to detect different properties of the image that classifier, obtaining 100% of accuracy.
were not directly related to the morphometry of the objects. Color, texture and visual descriptors were also explored in the liter-
dos Santos et al. (2018) proposed a multidimensional and multiscale ature for the investigation of NHL cancer. Meng, Lin, Shyu, and Chen
version of the SampEn associated with the fuzzy strategy to classify col- (2010) developed a quantification strategy that consisted of dividing
orectal cancer images. The technique considered different window sizes each NHL image into 25 sub-images and extracting 12,625 features
and different tolerance values to implement the multiscale approach. related to color and texture from each sub-image. The chi-square
The multidimensional approach was implemented considering each feature selection method was applied to select the 50 most relevant
image pixel as a 𝑛-dimensional color vector and the fuzzy approach was attributes and, the accuracy obtained was 92.70%. Song, Cai, Huang,
used to evaluate the relevance of the distances between the pixels of Feng, Wang, and Chen (2016) proposed a method capable of extracting
the windows. The authors concluded that this association of techniques visual attributes from grayscale NHL images and analyzing the results
with the logistic classifier allowed an AUC of 0.983, an important rate from different classifiers. The best result was accuracy of 96.80%
given the difficulties of clinical practice for the diagnosis of this type with the SVM classifier. Codella, Moradi, Matasar, Sveda-Mahmood,
of cancer. and Smith (2016) presented a technique to obtain twelve descriptors
Raju, Pabboju, and Rao (2020) combined the Dual-Tree M-Band from segmented NHL images, considering different spatial and color
wavelet transform with SampEn to extract features in brain MRI im- configurations. The authors also extracted visual descriptors from the
ages. The authors aimed to separate the images between normal and images. The SVM classifier was applied and the result was an accuracy
abnormal categories. The Naïve Bayes classifier was used. The method rate of 95.50%.
provided 97% accuracy, 98% specificity and 96% sensitivity. In addition to the techniques previously presented, Roberto et al.
Despite the advances presented previously, the discriminative abil- (2017) described a method based on percolation theory combined
ity of SampEn to quantify distinct patterns depends on establishing with multiscale and multidimensional strategies. The method analyzed
the best combinations of its parameters, such as the parameters 𝑚 and different combinations of features, segmentation methods and, classi-
𝑟 (da Silva Senra Filho et al., 2019; dos Santos et al., 2018; Humeau- fiers. The best result was an accuracy rate of 96.40%. do Nascimento
Heurtier, 2019; Richman & Moorman, 2000; Silva et al., 2018, 2016; et al. (2018) also performed the classification of CLL, FL and MCL
groups, extracting morphological and non-morphological features of
Yentes et al., 2013). A possible solution considers the strategy based
the cell nuclei. The authors applied several selection and classification
on texture signature, which has been widely applied in the context of
techniques. The best accuracy obtained was 100%. Ribeiro et al. (2018)
fractal techniques (Backes & Bruno, 2013; Backes et al., 2012; Căliman
proposed the combination of fractal, curvelet and, Haralick features
& Ivanovici, 2012; Dong, 2008; Peleg et al., 1984; Roberto et al.,
to classify the NHL groups. The authors tested several classifiers such
2017). For instance, Backes et al. (2012) combined signatures with
as decision tree, random forest, SVM, Naive Bayes and K*, and it was
the fractal dimension descriptor. The authors indicated that signatures
possible to obtain an accuracy rate of 86.14%.
provided a richness of image texture detail, which allowed more ac-
Also, Brancati et al. (2019), Janowczyk and Madabhushi (2016),
curate quantification. The signatures from images were also applied
Tambe et al. (2019), Zhang et al. (2021) and Bai et al. (2019) used deep
in association with neural networks (Junior, Backes, & Bruno, 2018).
learning to classify NHL groups. The authors used convolutional neural
The signatures were used to describe the color textures of the images.
networks (CNN) and hand-crafted features to extract and classify image
The authors performed tests on several image databases and, the results
patterns. Janowczyk and Madabhushi (2016) used an Alexnet model
were promising. The classification performance of the proposed tech-
with classification errors most commonly on low-quality images. Tambe
nique outperformed most of the methods compared, indicating that the
et al. (2019) performed the classification through an Inception V3
technique can be applied in different contexts. Thus, when the SampEn
network with hyperparameters tuning. Brancati et al. (2019) used an
is considered, texture signatures can be defined through the parameters
autoencoder with symmetric architecture and skip connections. This
𝑚 and 𝑟 in order to improve the quantitative capacity of this descriptor,
approach reached higher accuracy than U-net and ResNet. In contrast,
with new interpretations.
the ResNet50 model was used by Zhang et al. (2021) with image
These indications have motivated us to investigate a combination
transformations for obtaining sufficient data for training, validation
able to improve the classification of NHL images with tests performed
and test of the networks. Bai et al. (2019) used a modified version of
under different conditions. This strategy has not been fully applied as GoogLeNet with statistical, texture and color features. Among all these
in our proposal (Li et al., 2022), mainly with the analysis of tech- studies, it was possible to obtain accuracy rates between 96.58% and
niques present in different modules of a CAD system: normalization 99.33%.
approaches (Khan et al., 2014; Macenko et al., 2009; Reinhard et al., Battula and Sharma (2018) presented a review of published works
2001; Vahadane et al., 2016); segmentation techniques (De Oliveira of lymphoma histological images classification. This study highlight the
et al., 2013; Silva et al., 2019; Tang et al., 2018; Tokime et al., 2018; feature extraction by wavelet and Fourier transforms, Gabor filter and
Tosta, Faria, et al., 2017; Vahadane & Sethi, 2013; Wienert et al., 2012); gray level co-occurrence matrix. This matrix was also used by Taino
distinct and elevated types of noise (Feng et al., 2018; Zhu & Wu, 2004). et al. (2020) along with percolation, lacunarity and fractal features.
The proposed methodology used a genetic algorithm for selecting the
2.2. NHL classification systems best combination of features, selection algorithms, and classifiers. The
evolutionary algorithm allowed the selection of 73 features through
Methods for CAD systems have been developed to achieve the best the Relief method with the k-nearest neighbors, reaching an area
distinctions in the NHL categories, using multiple techniques (Li et al., under the curve of 0.947. Jiang et al. (2018) proposed a hierarchical
2022). approach with images divided into 130 patches, with no overlapping.
Orlov et al. (2010) applied Fourier and Wavelet transforms as The feature extraction was made by sparse autoencoder and texture
feature quantifiers in NHL images. According to the authors, the quan- representations. The classification reached a 97.96% of accuracy with
tification performed in the H&E color space with the weighted neighbor two steps: the first one separated FL from the other lymphomas samples

4
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

through an extreme learning machine, and the second one divided the 3.1. Datasets: NHL images
samples in CLL and MCL by using SVM algorithm.
Fractal features were also used by recent studies of lymphoma The proposed method was evaluated using NHL images from a
images. Martins et al. (2019) applied these features with lacunarity, public dataset, here identified as original H&E histological images,
analyzing different color models. The classification was performed by provided from studies carried out by researchers at the National Cancer
Hermite polynomial with an accuracy of 97.60%. Candelero et al. Institute and National Institute on Aging, both located in the United
(2020) also performed a lymphoma images classification with fractal States (National Cancer Institute, 2022; National Institute on Aging,
and CNN features, but also with texture representations. The best 2022; Shamir, Orlov, Eckley, Macura, & Goldberg, 2008). This is the
CNN features were selected by the ReliefF and evolutionary methods. main dataset explored by important techniques available in the liter-
The proposed methodology reached a maximum accuracy of 82.01% ature (Bai et al., 2019; Brancati et al., 2019; Codella et al., 2016; do
with K* classifier and the genetic algorithm. In comparison with other
Nascimento et al., 2018; Janowczyk & Madabhushi, 2016; Meng et al.,
analyzed images, it was possible to notice that these features better
2010; Nascimento et al., 2015; Orlov et al., 2010; Ribeiro et al., 2018;
represent breast cancer and colorectal cancer images. Roberto et al.
Roberto et al., 2021, 2017; Song et al., 2012; Taino et al., 2020; Tambe
(2021) classified histological images with CNN and fractal features. To
et al., 2019). The dataset was defined from 30 histological slides of
do so, 300 fractal features were scaled and offered as input for the CNN.
lymph nodes stained with H&E, with ten cases for each group: FL;
Besides, another CNN was used with the original images as inputs. The
class probabilities of both networks were used for the definition of the CLL; and, MCL. Based on the characteristics of each group, to better
images classes. The classification obtained an accuracy of 95.55%, with represent clinical practice, instead of the strictly controlled laboratory
future possible parameters optimizations. environment, the slides were obtained with significant variations in
Even with the advances previously presented to support the devel- sectioning and staining, which can represent challenges for image
opment of CAD systems dedicated to the context of NHL images, the processing methods.
methods were defined with techniques based on Fourier and wavelet For this dataset, the microscopic images were digitally obtained
transforms, morphological and non-morphological models, fractal anal- using a light microscope (Zeiss Axioscope) with a 20× objective and
ysis and deep learning strategies, for instance. However, the texture a color digital camera (Axio Cam MR5) attached. Regions of inter-
irregularities and measure of the disorder of the histological structures est for each slide were selected by specialists, digitally photographed
have not been verified through SampEn signatures and signature met- and recorded without compression, RGB color model, resolution of
rics, or even considering the descriptor of the behavior with multiple 1388 × 1040 pixels. In total, 375 images were generated, contain-
segmentation and color normalization techniques. These insights and ing 113, 140, and 122 cases of CLL, FL, and MCL, respectively. In
those previously mentioned (Section 2.1) have been the main moti- Fig. 2, examples of images corresponding to each of the groups are
vations for this study. This type of investigation can provide a full provided. It is important to highlight that the RGB color model was
understanding of solutions to distinguish NHL groups, expanding the also used to represent the results obtained in the segmentation and
quantitative capacity of SampEn and providing valuable information normalization methods. Then, for the feature extraction step, each RGB
for improving CAD systems, such as: association of SampEn signatures image was converted into a corresponding grayscale image, with an 8-
and different classifiers for evaluating H&E images; evaluation of the bit quantization rate, in order to allow the application of the SampEn
discriminative ability of SampEn signatures on NHL images, after apply-
technique.
ing multiple segmentation techniques and color normalization models;
evaluations based on signature metrics, enhancing the quantitative
3.1.1. Investigation considering segmentation techniques
capacity of the SampEn; indications of the best features, with tests
In addition to the original NHL images, the proposal was tested
under different conditions.
on the results provided by segmentation techniques. For this pur-
3. Methods pose, the NHL images were segmented both by a specialist (gold
standard) (Tosta, Faria, et al., 2017) and using methods available in the
The proposed method was divided into four main phases, as illus- literature, which included the approaches explored by De Oliveira et al.
trated in Fig. 1. In order to investigate the performance of this proposal, (2013), Vahadane and Sethi (2013), Wienert et al. (2012) and, Tosta,
the first phase (Datasets: NHL images, Section 3.1) defines the three Faria, et al. (2017). In addition to these previous works, we have also
histological datasets, with the original NHL images and the achieved applied recent approaches to perform the segmentation of the NHL
results with multiple segmentation and color normalization techniques. images, such as Tang et al. (2018), Tokime et al. (2018) and, Silva
The second phase (Feature extraction: sample entropy, Section 3.2) et al. (2019). The methods were applied to define regions of interest
performs the extraction of SampEn values, using distinct associations and investigate how these can influence the classification process of
of values for the parameters 𝑚 and 𝑟. From the SampEn values, the 𝑚 NHL groups. We chose these methods because they were specifically de-
signatures curves with 𝑟 points were constructed. Finally, the attributes signed and/or tested to study nuclear structures in histological images
area, skewness, maximum entropy point and area ratio were obtained stained with H&E.
from each SampEn signature, representing the signature metrics. The The methods based on (De Oliveira et al., 2013; Silva et al., 2019;
third phase (Classification, Section 3.3) performs three tests: classi-
Tang et al., 2018; Tokime et al., 2018; Tosta, Faria, et al., 2017;
fication provided by each SampEn value; a classification based on
Vahadane & Sethi, 2013; Wienert et al., 2012) were evaluated with
SampEn signatures; and, classification using signature metrics from
metrics commonly explored by studies related to the segmentation pro-
SampEn. The feature vectors were classified using different types of al-
cess of CLL, FL, and MCL groups: accuracy, sensitivity, and specificity.
gorithms. The goal was to identify the combination (dataset, attributes
These metrics were defined considering the concepts of true positive,
and classification approach) able to provide the best performance to
distinguish the NHL images, with the least number of features. The true negative, false positive, and false negative (Oger, Belhomme, &
best combination, in terms of accuracy, was observed under different Gurcan, 2012; Tosta, Neves, & do Nascimento, 2017). Thus, the accu-
conditions, defining the fourth phase (Evaluation, Section 3.4). Each racy value defines the percentage of correctly segmented pixels (nuclear
phase is described in detail in the following sections. structures and background) when compared to manual segmentation
The proposed method was implemented using the MATLAB® R2017a (specialist or gold standard, Tosta, Neves, & do Nascimento, 2017).
and Weka 3.8.1 (Garner et al., 1995) packages. We have used the Sensitivity indicates how many pixels from manual segmentation of
default values suggested by the tools. The experiments were performed nuclear structures were also identified by the computational approach.
on a computer with an Intel® CoreTM i5-3230M, 2.60 GHz, 3rd. Specificity indicates how many pixels defined as background by manual
generation, 8 GB of RAM and Windows operating system with 64-bit segmentation were also identified by the automatic technique (Oger
architecture. et al., 2012; Tosta, Neves, & do Nascimento, 2017).

5
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Fig. 1. Overview of the proposed method.

Fig. 2. Examples of the histological images from the lymphomas groups: (a) CLL; (b) FL; and, (c) MCL.

The first method applied was the one proposed by Wienert et al. explored the architecture described by Badrinarayanan, Kendall, and
(2012) which is a contour-based approach to cell segmentation. The Cipolla (2017) for evaluating the histological images. The method con-
approach uses minimal a priori information, identifying contours re- sists of a CNN architecture that performs semantic pixel segmentation.
gardless of shape. The CNN is composed of an encoder network, a decoder network, and
The method proposed by De Oliveira et al. (2013) is a three-step a pixel classification layer. The function of the decoder network is to
segmentation process. In the first step, a contrast elongation technique perform a conversion of the low-resolution feature maps, generated by
is applied on the R channel of the RGB color model. In the second the encoder network, to high-resolution feature maps.
step, global thresholding based on minimum cross-entropy is applied, Finally, we used the Expectation–Maximization method (EM) with
and finally, in the third step, false positives are removed using the the Gaussian Mixture Model (GMM), the EM-GMM model, as a strat-
connected components method. egy to define the nuclei segmentation in H&E images (Silva et al.,
The method proposed by Vahadane and Sethi (2013) is an improve- 2019). The EM-GMM model was based on a proposal described by Ir-
ment on the seeded watershed segmentation method. First, foreground shad, Veillard, Roux, and Racoceanu (2013), Ragothaman, Narasimhan,
seeds are generated through rapid radial symmetry transformation. Basavaraj, and Dewar (2016). Thus, the GMM creates probability den-
After that, the Otsu thresholding method is applied in the histograms sity functions (PDF) to estimate the probability that each pixel belongs
and background markers are calculated. Finally, false true detection to a class. The parameters of these PDF functions are estimated using
and removal are performed using Boolean operators. the EM method, which consists of steps E and M. Step E computes
The method proposed by Tosta, Faria, et al. (2017) consists of the logarithmic probability of the entire dataset. Step M finds the
an unsupervised method of segmentation of nuclear components of parameters by maximizing the logarithmic probability. These steps are
neoplastic cells that combines histogram equalization, Gaussian filter, repeated until they converge, and the mean and covariance matrices
fuzzy 3-partition entropy, genetic algorithm, morphological techniques, obtained in the convergence are chosen as the final parameters in the
and method of vouchers emphasis. model.
A strategy based on CNN was also applied in our investigation,
considering the study performed by Tokime et al. (2018). These authors 3.1.2. Investigation considering color normalization techniques
used the architecture described by Ronneberger, Fischer, and Brox Another application context of the model aimed to define the dis-
(2015) in order to define nuclei segmentation in H&E images. This criminative power of SampEn signatures on histological images after
strategy uses a CNN architecture called UNet that performs semantic applying color normalization techniques. Color normalization methods
segmentation. The UNet is an encoder–decoder network wherein the are developed as preprocessing strategies to assist in correcting color
encoder is a series of convolutional layers that learn to extract low- variations due to the staining process.
resolution features, while the decoder allows precise localization of It is important to highlight that the H&E staining is useful for pattern
high-resolution features. recognition and classification because the stains allow to highlight
Another CNN applied to provide segmented images in our study was cell structures of interest. Hematoxylin stains acid structures in blue–
based on an approach presented by Tang et al. (2018). The authors purple shades, such as nuclei. Eosin stains basic pH components in pink

6
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

shades, such as cytoplasm areas. The images obtained from this process Besides, histological image pixels represent a stain mixture, which
represent these conditions through colors, or different intensities when justifies the use of the sparsity approach. This estimation was made
observed in grayscale. Thus, the architecture of tissue components by a dictionary learning reaching a global optimum which allowed the
and cellular morphological aspects are highlighted, which are essential stain density map estimation through a robust sparse coding. Before
for tissue texture extraction and cancer diagnosis (Tosta, de Faria, these estimations, the RGB values of both source and target images
Neves, & do Nascimento, 2019; Tosta, Neves, & do Nascimento, 2017). were converted to the optical density space and after those, the nor-
However, the existing variations can also be caused by the non-uniform malized image was defined as performed by Macenko et al. (2009).
distribution of the H&E stains or by bad lighting conditions during the The RGB image was then obtained by application of the inverse of the
acquisition process (Feng et al., 2018; Jothi & Rajam, 2017). This condi- Beer–Lambert law.
tion can influence the interpretation and decision-making of specialists
and computer systems during the process of diagnosing a disease under 3.2. Feature extraction: sample entropy
investigation. For this reason, the color normalization techniques of
H&E images have been increasingly investigated to compose a new SampEn calculations were performed by considering that each im-
phase in the process of developing diagnostic support systems. The age is a matrix 𝑈𝑖𝑗 , and each position within the image was related to
color normalization consists of standardizing the colors of all images a grayscale value (Richman & Moorman, 2000; Silva et al., 2016; Yeh
that are part of a dataset (Jothi & Rajam, 2017). In this context, the et al., 2011). Sub-matrices (windows) 𝑥𝑚 𝑖,𝑗 sized 𝑚 × 𝑚, overlapped the
applied techniques were the models described by Khan et al. (2014), image sized 𝑁𝑈 = max(𝑖) × max(𝑗) with the goal of attaining samples of
Macenko et al. (2009), Reinhard et al. (2001) and, Vahadane et al. possible patterns contained in 𝑈𝑖𝑗 , are given by:
(2016), using the results presented by Tosta, de Faria, Servato, et al.
(2019). 𝑥𝑚
𝑖,𝑗 = {𝑢(𝑖, 𝑗), 𝑢(𝑖, 𝑗 + 1), … , 𝑢(𝑖 + 1, 𝑗), … , 𝑢(𝑖 + 𝑚 − 1, 𝑗 + 𝑚 − 1)}, (1)
The method proposed by Reinhard et al. (2001) is one of the most
applied in the literature. First, the images are converted from the wherein 1 ≤ 𝑖 ≤ max(𝑖)−𝑚+1 and 1 ≤ 𝑗 ≤ max(𝑗)−𝑚+1. To verify existing
RGB space to the 𝑙𝛼𝛽 space due to its minimum correlation among its similarities, each window 𝑥𝑚 𝑖,𝑗 was fixed in position (𝑖, 𝑗) and compared
channels in natural images. This condition allows that color alterations to windows 𝑦𝑚 𝑖′ ,𝑗 ′
, which were positioned throughout the image.
using this model do not lead to artifacts in the resulting image. The The similarities between two patterns were calculated from a dis-
𝑚 between two windows 𝑥𝑚 and 𝑦𝑚 . The distance was defined
tance 𝑑𝑥,𝑦
metrics of mean and standard deviation are calculated for each 𝑙𝛼𝛽 𝑖,𝑗 𝑖′ ,𝑗 ′
channel for the target and source images. The new 𝑙𝛼𝛽 channels of the as the maximum difference among the pixel intensity of the windows:
source image are then computed by all these metrics. In the end, the ′ ′
𝑖,𝑗 , 𝑦𝑖′ ,𝑗 ′ ] = max |𝑢(𝑖 + 𝑘, 𝑗 + 𝑙) − 𝑢(𝑖 + 𝑘, 𝑗 + 𝑙)|,
𝑚
𝑑𝑥,𝑦 = 𝑑[𝑥𝑚 𝑚
(2)
𝑙𝛼𝛽 channels are converted to the RGB model. Although it was proposed
for natural images, the normalization of Reinhard et al. (2001) is wherein 𝑘 > 0, 𝑙 < 𝑚 − 1 and 𝑖, 𝑗 ≠ 𝑖′ , 𝑗 ′ , not including self-comparisons.
commonly used by studies of histological images processing (Tosta, The probability of a window 𝑦𝑚 𝑖′ ,𝑗 ′
being at a maximum distance of 𝑟
de Faria, Neves, & do Nascimento, 2019). from a fixed window 𝑥𝑚 𝑖,𝑗 was defined by determining the average count
Another approach applied was following the method presented of all windows 𝑦𝑚𝑖′ ,𝑗 ′
that were close to 𝑥𝑚 𝑖,𝑗 , according to Eq. (3).
by Macenko et al. (2009). In this approach, the RGB colors in images
are converted to the optical density space. This space allows linear 1 ∑ max(𝑗)−𝑚
max(𝑖)−𝑚 ∑
𝐶 𝑚 (𝑥𝑚
𝑖,𝑗 ) = 𝜃(𝑑[𝑥𝑚 𝑚
𝑖,𝑗 , 𝑦𝑖′ ,𝑗 ′ ] − 𝑟), (3)
combination of the stains to be also linearly represented. In this repre- 𝐷−1 𝑖′ =1 𝑗 ′ =1
sentation, each RGB channel is converted to its corresponding optical
densities by calculating the logarithm of its normalized value. This with (𝑖, 𝑗) ≠ (𝑖′ , 𝑗 ′ ), 𝐷 = (max(𝑖) − 𝑚) × (max(𝑗) − 𝑚) and 𝜃(𝑑[𝑥𝑚 𝑚
𝑖,𝑗 , 𝑦𝑖′ ,𝑗 ′ ] − 𝑟)
processing allows the estimation of a stain color appearance matrix and the Heaviside function defined by:
{
a stain density map. The stain color appearance matrix specifies the rep- 1, if 𝑑𝑥𝑦 𝑚 − 𝑟 ≤ 0,
𝑚
resentation of each stain by the RGB channels and the stain density map 𝜃(𝑑𝑥𝑦 − 𝑟) = (4)
0, if 𝑑𝑥𝑦 𝑚 − 𝑟 > 0,
quantifies the saturation of each stain in each pixel (Tosta, de Faria,
Neves, & do Nascimento, 2019). To estimate these components, optical wherein 𝑟 is a tolerance parameter that defines whether two patterns
densities were used to estimate the color appearance matrix by the are considered similar. Two patterns are considered similar if each
singular value decomposition method. The density map could be then corresponding pixel within the patterns is distant at most 𝑟 from
calculated using the optical densities and the matrix coefficients. These each other (Silva et al., 2016). Thus, the value of 𝑟 is determined by
estimates were performed for the target and source images, obtaining evaluating the standard deviation of the analyzed image, given by:
a normalized image through combination between the target image
matrix and the source image map. 𝑟 = 𝛽 × 𝜎(𝑈 ), (5)
The stain color appearance matrix was also used by Khan et al. in which 𝛽 is the real positive constant and 𝜎 is the standard deviation
(2014). This technique estimates the color appearance matrix by a of image 𝑈 .
classification of pixels into hematoxylin, eosin, and background using SampEn was calculated using the negative of natural logarithm of
an oct-tree quantization, RGB values, and the relevance vector machine the ratio between similarity coefficients of all analyzed patterns of size
classifier. This processing was performed for both target and source (𝑚 + 1) × (𝑚 + 1) with similarity coefficients of patterns 𝑚 × 𝑚:
images, which were deconvolved for further nonlinear mapping be-
tween them. To do so, the metrics of mean, 5th percentile, and 95th 𝛷𝑚+1
𝑆𝑎𝑚𝑝𝐸𝑛(𝑚, 𝑟, 𝑁𝑈 ) = − ln , (6)
percentile were calculated for each channel of both images and used for 𝛷𝑚
mapping these metrics of the source image to the target image through wherein 𝛷𝑚+1 and 𝛷𝑚 are the similarity coefficients defined respec-
the B-spline method. The normalized image was obtained by the target tively by:
color appearance matrix combined with the stain channels of the source
1 ∑
max(𝑖)−(𝑚+1) ∑
max(𝑗)−(𝑚+1)
image. 𝛷𝑚+1 = 𝐶 𝑚+1 (𝑥𝑚+1 (7)
𝐷 𝑖,𝑗 ),
Vahadane et al. (2016) presented an estimation of the stain color 𝑖=1 𝑗=1
appearance matrix by a sparse non-negative matrix factorization. This and
method was chosen since the matrix coefficients cannot have negative
values, which would be biologically incorrect because this condition 1 ∑ max(𝑗)−𝑚
max(𝑖)−𝑚 ∑
𝛷𝑚 = 𝐶 𝑚 (𝑥𝑚
𝑖,𝑗 ). (8)
represents a light-emitting that is not possible in the tissue staining. 𝐷 𝑖=1 𝑗=1

7
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 1
Average standard deviation values obtained from different combinations of 𝑣 sub-images sized 𝑛 × 𝑛.
𝑛 𝑣
20 40 60 80 100 120 140 160 180
30 0.0680 0.0489 0.0421 0.0362 0.0335 0.0282 0.0272 0.0252 0.0243
40 0.0597 0.0423 0.0341 0.0295 0.0286 0.0253 0.0222 0.0222 0.0198

In this paper, SampEn was applied in order to use sub-images to increments of 0.02. The chosen values (𝑚 and 𝑟) were widely discussed
enable the quantification of images. This step is necessary to determine within important works available in the literature, such as by da Silva
SampEn values, since the quantity of comparisons among pixels was on et al. (2014), Silva et al. (2016), and Richman and Moorman (2000).
the order of 2×(max(𝑖)−𝑚+1)2 ×(max(𝑗)−𝑚+1)2 (Yeh et al., 2011). The A signature of 𝑚 = 1 and 𝑟 = 0.06 provided the quantification of
selected procedure considered 𝑣 sub-images of size 𝑛 × 𝑛, which were minor patterns contained within an image, and indicated increases in
randomly selected throughout different positions of the input image
restrictive tolerance. On the other hand, a signature obtained using
𝑈 , following a uniform distribution model. After calculating values for
𝑚 = 4 and 𝑟 = 0.04 facilitated the quantification of major patterns, and
each sub-image, a SampEn value for the whole image was obtained
indicated a decreasing restrictive tolerance. Quantifications obtained
using the simple arithmetic average of the entropy values of all chosen
using SampEn were organized as curves (texture signatures) in the
sub-images (𝑣).
coordinate system SampEn (Y axis) × 𝑟 (X axis). Signatures were applied
𝑆𝑎𝑚𝑝𝐸𝑛1 (𝑚, 𝑟, 𝑛) + ⋯ + 𝑆𝑎𝑚𝑝𝐸𝑛𝑘 (𝑚, 𝑟, 𝑛)
𝑆𝑎𝑚𝑝𝐸𝑛𝑈 (𝑚, 𝑟, 𝑁𝑖𝑚𝑔 ) = , (9) to quantify texture complexity for different values of 𝑚, and provided
𝑣
a more complete and accurate description of the studied images. For
in which 𝑆𝑎𝑚𝑝𝐸𝑛𝑈 represents the entropy of the image and 𝑆𝑎𝑚𝑝𝐸𝑛1 , each image, four texture signatures were obtained, one for each value
⋯, 𝑆𝑎𝑚𝑝𝐸𝑛𝑘 are entropies of the sub-images 1 to 𝑘, respectively.
of 𝑚.
The values of 𝑛 and 𝑣 used for quantifying the NHL images were
defined considering a test that was based on standard deviation val-
ues (Yeh et al., 2011). The test involved calculating SampEn repeatedly 3.2.2. Signature metrics
for the images included within the lymphoma dataset, and considering
Each SampEn signature was quantitatively evaluated from met-
different values for parameters 𝑛 and 𝑣. Also, the standard deviation
rics (Căliman & Ivanovici, 2012), with the goal of quantifying the
of SampEn values were calculated. The acceptable values of 𝑛 and 𝑣
behaviors of each NHL group as a function of multiple observations
were calculated from tests performed on an image subset from National
Cancer Institute (2022) and National Institute on Aging (2022). This of 𝑚. The applied metrics included area under curve (𝐴), skewness (𝑆),
subset was comprised of approximately 10% of the available images maximum point of entropy (𝑃𝑚𝑎𝑥 ) and area ratio (𝑅).
of each group. The images were randomly chosen. Each combination The metric 𝐴 was defined as the existing area between a function,
(𝑛, 𝑣) was tested 10 times per image, without window overlapping, 𝑓 (𝑥), and axis 𝑥, on an interval [𝑎, 𝑏]. A value of 𝐴 was obtained
which made it possible to obtain the average value of SampEn and the by calculating the integral of 𝑓 (𝑥) for range [𝑎, 𝑏]. However, since
standard deviation of the chosen image. This process was applied to all SampEn is a discrete function, the value of 𝐴 was computed from a
images within the subset. Thus, the combination (𝑛, 𝑣) that resulted in trapezoid-based approach (Mathworks, 2022b):
standard deviation value below 0.02 was selected for quantifying all
𝑁𝑒
images of the lymphoma dataset. Sufficiently small standard deviation 1∑
𝐴≈ (𝐸 − 𝐸𝑖 )[𝑓 (𝐸𝑖 ) + 𝑓 (𝐸𝑖+1 )], (10)
value indicates that SampEn value was acceptable for quantifying 2 𝑖=1 𝑖+1
the image. This type of analysis and standard deviation thresholding
procedure were previously defined by Yeh et al. (2011). Examples of wherein (𝐸𝑖+1 − 𝐸𝑖 ) indicates the spacing between two consecutive
combinations used are presented on Table 1, which includes average points, 𝑁𝑒 is the number of entropy values of the curve and 𝑓 (𝐸𝑖 ) is
standard deviation values for each pair (𝑛, 𝑣). The best combination the entropy value of point 𝐸𝑖 .
with the imposed requirement was obtained with 𝑛 = 40 and 𝑣 = 180, The metric 𝑆 indicated curve asymmetry relative to its average
and provided an average standard deviation of 0.0198. These values value. If the values of the curve are more concentrated to the right
were selected for the study of lymphoma images. of the curve average, skewness is positive. When values of the curve
are more concentrated to the left, skewness is negative. Skewness was
3.2.1. SampEn signatures calculated according to Eq. (11) (Mathworks, 2022a):
A texture signature is useful for quantifying the spatial arrange- ∑ 𝑁𝑒
1
ment and intensities of pixels in a digital image. In our proposal, 𝑖=1
𝑁𝑒
(𝑓 (𝐸𝑖 ) − 𝑓 (𝐸))3
this was made possible by computing SampEn values from multiple 𝑆= √ , (11)
∑ 𝑁𝑒
observations, which involved quantifications using different combina- ( 𝑁1 𝑖=1
(𝑓 (𝐸 𝑖 ) − 𝑓 (𝐸))2 )3
𝑒
tions of values for the parameters 𝑚 and 𝑟. The parameter 𝑚 was
employed to determine pixel window sizes in order to assess patterns wherein 𝑓 (𝐸𝑖 ) is the 𝑖th entropy value, 𝑓 (𝐸) is the curve average and
within an image. The parameter 𝑚 was derived from the set N∗ and 𝑁𝑒 is the number of points in the curve.
represented the observation dimension. It is important to note that
The metric 𝑃𝑚𝑎𝑥 made it possible to acquire the highest degree of
the greater the 𝑚 value, the greater the possibility of no pattern
entropy for each signature obtained for each function of 𝑚.
matches (Humeau-Heurtier, 2019). Thus, the lack of similarities be-
tween windows produces division by zero in Eq. (6). In this paper, The metric 𝑅 was determined from the average value of each
the quantifications were performed using 𝑚 values ranging from 1 to analyzed signature. A 𝑅 value was calculated as the ratio between areas
4, at intervals of one unit, in order to avoid this problem. Also, the on the left and right sides relative to the center of the curve, according
parameter 𝑟 was applied to indicate the tolerance level so that two to the following Equation:
windows sized 𝑚 are defined as similar. Similarity was defined as the 𝐴(𝑁𝑒 ∕2+1,𝑁𝑒 )
𝑚 between two windows (𝑥𝑚 and 𝑦𝑚 ) according to Eq. (2).
distance 𝑑𝑥,𝑦 𝑅= , (12)
𝑖,𝑗 𝑖′ ,𝑗 ′ 𝐴(1,𝑁𝑒 ∕2)
The parameter 𝑟 was a real number greater than zero. Therefore, the
signatures were obtained for values of 𝑟 between 0.06 and 0.4, at in which 𝑁𝑒 indicates the number of points in the curve.

8
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

The SVM algorithm is a deterministic supervised classification


method, wherein each instance is perceived as a point in a space with
a dimension that is equal to the number of extracted features. This
technique allows researchers to define a hyperplane that provides an
optimal degree of point separation between distinct classes, which have
the greatest possible distance between the hyperplane and its nearest
points. There are different approaches for defining the hyperplane,
which are known as kernel functions (Platt, 1999). The used parameters
were 𝐶 = 1, 𝜖 = 10−12 and polynomial kernel.
K* associates a class to an instance within a test set from classes of
the training set using similarity. For each test instance, a comparison
between the whole training set is performed, while considering entropy
Fig. 3. Illustration of the feature vector structure considering the metrics 𝐴, 𝑆, 𝑃𝑚𝑎𝑥 as a criterion. The class that provides the highest complexity is defined
and 𝑅 for each signature (𝑚 ranging from 1 to 4). as the class of the new instance (Cleary & Trigg, 1995). The value of
20% was used as the global bend parameter.
The random forest (RF) algorithm combines several decision trees in
3.2.3. Compositions based on SampEn features order to perform class prediction (Breiman, 2001). A decision tree is a
The SampEn values were obtained from the three datasets (original statistical algorithm composed of nodes connected by edges wherein
internal nodes represent decisions derived from a rule and leaves
histological images and those resulting from the multiple segmentation
represent the possible classifications. A tree is built for each class of the
and color normalization techniques). Thus, the values were used by
training set. Each instance of the set is evaluated in relation to the built
exploring different strategies. The first one considered all the SampEn
trees, using a process based on voting. The class that obtains the most
values, which were calculated for each combination of 𝑚 and 𝑟. This votes is specified as the class of the studied instance. The parameters
strategy explored the features defined as SampEn signature. The result used were maximum depth equal to 0, size of each bag equal to 100%,
is a set with 72 SampEn values, that were produced using 𝑚 values 100 iterations and minimum number of instances in the leaves equal
that ranged from 1 to 4 and 𝑟 values that ranged from 0.06 to 0.40. to 1.
The second strategy aimed to identify the discriminative power of each The genetic algorithm (GA) uses an optimal or sub-optimal solution,
SampEn value (analyzed individually). This type of test is the most which is sought using techniques inspired by the process of natural
frequently described in the literature (da Silva et al., 2014; Richman selection in evolutionary biology (Pei, Goodman, Punch, & Ding, 1995).
& Moorman, 2000; Silva et al., 2016). Here, this type of test was Natural selection consists in the evolution of generations of individuals,
identified as classification provided by each SampEn value. Finally, and is designed to eliminate weak individuals and retain the ones con-
the signature metrics (𝐴, 𝑆, 𝑃𝑚𝑎𝑥 and 𝑅) were calculated for each sidered strong. In the algorithm, possible solutions contain subgroups
SampEn signature, totaling 16 attributes, as illustrated in Fig. 3. This of features called chromosomes. For each generation, chromosomes are
last strategy represents another important contribution of this work subjected to the processes of crossover and mutation. The best chromo-
considering the quantitative power of SampEn in the context of NHL somes are passed forward the next generation. The KNN algorithm was
used to evaluate the classification power of each generation. This cycle
images.
was repeated until the convergence toward an optimal solution was
achieved. The parameters used a crossover probability equal to 60%,
3.3. Classification maximum number of generations equal to 20, mutation probability
equal to 3.3% and population size of 20.
The classifications were performed through different strategies, such Particle swarm optimization (PSO) is based on bird flocking and fish
as support vector machine - SVM (function-based), K* (lazy learning) schooling behaviors. The process provides means to search an optimal
and random forest (decision tree-based). Also, evolutionary approaches solution via information exchange between individuals of a popula-
were applied in order to define the most relevant features in each tested tion (Kennedy, 2011). In the algorithm, each solution is considered to
combination, with a full understanding of the solutions to distinguish be a particle with characteristics that define its direction of movement.
the NHL groups. The chosen strategies were genetic algorithm (genetic These characteristics include its current position, objective function
evolution-based), particle swarm optimization (particle swarm-based) and speed vector. The objective function is applied to determine the
particle with the highest discriminative power in the current iteration,
and binary gray wolf optimization (animal behavior-based), all using
as evaluated by the KNN classifier. Through information exchange,
the K-nearest neighbor (KNN) classifier. KNN is a commonly used
the other particles follow the movements of the best particles. This
method for fitness function due to its simplicity, fast processing speed
behavior is repeated until the optimal solution is found. The used
and high classification performance (Emary, Zawbaa, & Hassanien,
parameters were individual weights equal to 0.34, inertia equal to
2016; Hans & Kaur, 2022; Li, Xue, & Zhang, 2021; Maleki, Zeinali, & 0.33, mutation probability of 1%, population size of 20 individuals, 20
Niaki, 2021; Too & Abdullah, 2021). iterations and social weight of 0.33.
Each classification strategy was evaluated using the weka v3.8.1 Binary gray wolf optimization (bGWO) is an optimization method
package (Garner et al., 1995) and the corresponding algorithms were that is based on the hunting strategy of gray wolves (Emary et al.,
SMO (Sequential Minimal Optimization), KStar, RandomForest, Genet- 2016). The hunting tactic of a pack consists of following, getting
icSearch, PSOSearch and WolfSearch. The performance metric used to closer, chasing and surrounding the prey until it stops moving. In the
evaluate the tests was accuracy (𝐴𝑐𝑐), according to Eq. (13). Also, mathematical model, the wolves are candidate solutions of subgroup
since some of the NHL groups as well as the different types of images features. Initially, the positions of wolves were randomly set. The best
explored have a small number of samples, we chose to apply 3-fold candidates were selected according to fitness functions, which were
cross-validation in all testing stages in order to avoid problems such as evaluated according to the classification power of the solutions within
overfitting. the current iteration, which was tested using the KNN classifier. For
each iteration, positions of wolves were updated until the best solution
𝑇𝑃 + 𝑇𝑁 was found (Emary et al., 2016; Mirjalili, Mirjalili, & Lewis, 2014). The
𝐴𝑐𝑐 = , (13)
𝑁𝐶 parameters used for the tests were a population of 20 individuals, 20
wherein 𝑇 𝑃 is the number of cases classified as true positives, 𝑇 𝑁 is iterations, and absorption coefficient equal to 0.001, escape probability
the number of cases classified as true negatives and 𝑁𝐶 is the total of 80%, chaotic coefficient equal to 4, mutation probability of 1% and
number of cases. minimum beta of 0.33.

9
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Fig. 4. Examples of segmented images obtained from a CLL sample (a), (Shamir et al., 2008). The images represent the segmentation results performed by a specialist (b) and
after applying the techniques described by De Oliveira et al. (2013) (c), Vahadane and Sethi (2013) (d), Wienert et al. (2012) (e), Tosta, Faria, et al. (2017) (f), Tokime et al.
(2018) (g), Tang et al. (2018) (h), and Silva et al. (2019) (i).

3.4. Evaluation pixel corruption was applied by randomly selecting pixels from the
image and replacing them with values following a uniform distribution
The discriminative capacity of the best combination was observed in over zero and the largest image pixel value. The white Gaussian noise
different situations in order to verify the contributions and limitations. was also added by randomly selecting pixels but replacing them with
Details are presented in the next subsections. its values plus a value following the standard normal distribution with
zero of mean and 1 of standard deviation. Each test was evaluated after
3.4.1. Applying different levels of noise adding noise in relation to the total pixels of the image.
The best combination of SampEn features and classifier was ana-
lyzed by intentionally applying different types of noise, as described
3.4.2. Indications of comparisons with previous works and distinct texture
by Feng et al. (2018) and Zhu and Wu (2004). These tests are important
descriptors
for determining the capacity of the model to classify CLL, FL and
All the tests explored here are important to indicate the best combi-
MCL groups under variable conditions. Thus, each test was evaluated
nation of techniques for the classification of NHL groups. In addition,
considering the accuracy metric and noise levels of 20%, 40% and,
we compared our best result to the performances provided by other
80%, as described by Martins et al. (2017), Xiao, He, Jiang, and Liu
techniques that explored the NHL dataset. The techniques were pre-
(2010) and, Zhu and Wu (2004).
sented by Bai et al. (2019), Brancati et al. (2019), Candelero et al.
Considering the model presented by Zhu and Wu (2004), the best
(2020), Codella et al. (2016), do Nascimento et al. (2018), Janowczyk
combination of features was analyzed using the training and test sets
(𝐶𝑡𝑟𝑎𝑖𝑛 and 𝐶𝑡𝑒𝑠𝑡 ) that were defined using the 𝑘-fold cross-validation and Madabhushi (2016), Jiang et al. (2018), Martins et al. (2019),
technique, with a value of 𝑘 = 3. The clean training and test data Meng et al. (2010), Orlov et al. (2010), Ribeiro et al. (2018), Roberto
were used to compose their corresponding sets with noise (dirty). The et al. (2021, 2017), Song et al. (2012), Taino et al. (2020), Tambe
𝐷𝑡𝑟𝑎𝑖𝑛 and 𝐷𝑡𝑒𝑠𝑡 files were generated by applying different levels of noise et al. (2019), and Zhang et al. (2021), all exploring different combina-
over the 𝐶𝑡𝑟𝑎𝑖𝑛 and 𝐶𝑡𝑒𝑠𝑡 data. Features affected by noise were randomly tions from that applied in our proposal. In general, these approaches
selected. Therefore, a new noisy training set 𝐷𝑡𝑟𝑎𝑖𝑛 was obtained from were based on Fourier and wavelets transforms, Gabor filter, local
𝐶𝑡𝑟𝑎𝑖𝑛 and applied to distinguish instances of the test set 𝐶𝑡𝑒𝑠𝑡 : 𝐷𝑡𝑟𝑎𝑖𝑛 × binary pattern, morphological features, fractal approaches, or convo-
𝐶𝑡𝑒𝑠𝑡 . The same percentage of noise was included in the test set to lution kernels, as well as different classifiers (support vector machine,
generate the noisy test 𝐷𝑡𝑒𝑠𝑡 . After this procedure, the comparisons also random forest, DECORATE, and Polynomial), or deep learning-based
involved 𝐶𝑡𝑟𝑎𝑖𝑛 ×𝐷𝑡𝑒𝑠𝑡 and 𝐷𝑡𝑟𝑎𝑖𝑛 ×𝐷𝑡𝑒𝑠𝑡 , which represented clean training methods. Finally, comparisons with different texture descriptors were
feature sets against noisy test feature sets, as well as noisy training also performed, especially considering the techniques based on mul-
feature sets against noisy test feature sets. tidimensional and multiscale quantifications, such as fractal dimen-
Finally, we also tested the best combination of SampEn features and sional (Ivanovici & Richard, 2011), lacunarity (Căliman & Ivanovici,
classifiers under image noise conditions. We applied well-known noises 2012; Ivanovici, Richard, & Decean, 2009) and percolation (Roberto
to the field of histopathological imaging: random pixel corruption et al., 2021, 2017). These methods were applied to analyze the different
(RPC) and white Gaussian noise (WGN) (Feng et al., 2018). The random NHL images (original, segmented and normalized).

10
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Fig. 5. Examples of images obtained after applying color normalization techniques to a CLL sample (Fig. 4(a)). The results were defined from the proposals of Reinhard et al.
(2001) (a), Macenko et al. (2009) (b), Khan et al. (2014) (c), and Vahadane et al. (2016) (d).

Fig. 6. Accuracy values: based on total pixels segmented correctly (nuclear structures and background) by computational methods when compared to manual segmentation
(specialist, Tosta, Faria, et al., 2017), considering CLL (a), FL (b) and MCL (c) groups.

4. Results according to the results presented by Tosta, de Faria, Servato, et al.


(2019). Fig. 5 shows normalized images obtained from a histological
The best combination of techniques was determined after analyzing sample of the CLL group (Fig. 4(a)).
the original NHL images and the achieved results with multiple segmen- It is important to highlight that segmentation results can be eval-
tation and color normalization techniques. Thus, the results were based uated through different metrics. Thus, in relation to the segmentation
on images segmented by a specialist (gold standard) (Tosta, Faria, quality achieved with each technique in NHL images, an overview is
et al., 2017) and computational techniques (De Oliveira et al., 2013; illustrated in Figs. 6–8, considering box plots of accuracy, sensitivity,
Silva et al., 2019; Tang et al., 2018; Tokime et al., 2018; Vahadane and specificity, respectively, for all kinds of investigated NHL types.
& Sethi, 2013; Wienert et al., 2012). Fig. 4 shows the segmented These metrics were described in Section 3.1.1. Regarding the normal-
images obtained from a sample of the CLL group (Fig. 4(a)). Also, ization process for NHL images, the quality achieved in each technique
color normalization techniques were applied to assist in correcting was presented in detail by Tosta, de Faria, Servato, et al. (2019).
color variations due to the staining process (Jothi & Rajam, 2017). The Considering the segmentation results, Fig. 6 illustrates that the
explored models were the proposed by Khan et al. (2014), Macenko best median accuracies were reached by the techniques of Tokime
et al. (2009), Reinhard et al. (2001), and Vahadane et al. (2016), et al. (2018) and Tang et al. (2018) in the images of all NHL groups,

11
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 2
Mean values of SampEn for the groups of lymphomas CLL, FL, and MCL.
𝑚=1 𝑚=2 𝑚=3 𝑚=4
𝑟 = 0.06 ⋯ 𝑟 = 0.12 𝑟 = 0.14 ⋯ 𝑟 = 0.20 𝑟 = 0.22 ⋯ 𝑟 = 0.32 ⋯ 𝑟 = 0.28 ⋯ 𝑟 = 0.32 ⋯ 𝑟 = 0.40
CLL 5.9781 ⋯ 4.3801 3.9192 ⋯ 3.3062 3.0523 ⋯ 2.2824 ⋯ 2.4784 ⋯ 2.2972 ⋯ 2.0000
FL 5.1097 ⋯ 3.2908 2.9588 ⋯ 2.4218 2.2832 ⋯ 1.7130 ⋯ 1.9555 ⋯ 1.7968 ⋯ 1.5294
MCL 5.8143 ⋯ 4.1413 3.7120 ⋯ 3.0904 2.8915 ⋯ 2.1704 ⋯ 2.3483 ⋯ 2.1922 ⋯ 1.9043

Fig. 7. Sensitivity values: based on total pixels defined as nuclear structures by computational methods when compared to manual segmentation (specialist, Tosta, Faria, et al.,
2017), considering the CLL (a), FL (b) and MCL (c) images.

with values beyond 80%. This demonstrates that, among the evaluated by the techniques of Tokime et al. (2018) and Tang et al. (2018) in all
techniques, those based on CNN were capable of identifying in a higher classes. However, the sensitivity results obtained by these techniques
proportion nuclear and background regions as such, in comparison with indicate an under-segmentation, with low true positive rates (nuclei)
the specialist segmentation. These techniques also reached the highest and high true negative rates (background). The segmentation meth-
minimum values in all NHL classes, with the exception of some outliers ods of Wienert et al. (2012) and Tosta, Faria, et al. (2017) reached
in the MCL class (Fig. 6(c)). intermediate results in all NHL classes and all metrics.
The sensitivity results illustrated in Fig. 7 show that the tech-
niques of De Oliveira et al. (2013) and Silva et al. (2019) reached 4.1. SampEn values applied to distinguish NHL groups
the highest median values in all classes, with results higher than
60%. This condition demonstrates that nuclear regions segmented by The proposed method was applied to quantify histological NHL sam-
the specialist were also identified by these techniques. In contrast, ples (original, segmented, and normalized) and the results consisted of
these methods obtained the lowest median values of specificity in all SampEn values, according to the (𝑚, 𝑟) parameters, for each image given
NHL groups, as illustrated by Fig. 8. This condition indicates that as input. For instance, Table 2 displays the averages of the SampEn
these segmentation approaches did not identify the background like values that were calculated for the original images from the CLL, FL,
the specialist. Through the joint analysis of sensitivity and specificity and MCL groups. Also, to illustrate the statistical significance of these
results, it is possible to note that these techniques obtained a high features, the Wilcoxon–Mann–Whitney test was applied to the SampEn
true positive rate (nuclei) and a low true negative rate (background), values that represent the different NHL groups. The significance level
which characterizes an over-segmentation, with nuclear areas bigger was of 0.05. The results were mean 𝑝-values of 0.0036 (CLL versus
than the gold standard. The highest specificity values were obtained FL), 0.1458 (CLL versus MCL), and 0.0064 (FL versus MCL). These

12
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Fig. 8. Specificity values: based on total pixels defined as background by computational methods when compared to manual segmentation (specialist, Tosta, Faria, et al., 2017),
considering the CLL (a), FL (b) and MCL (c) images.

results are useful to show that the SampEn values present statistically are shown on Table 4, considering the different types of images and
relevant differences for the NHL groups, mainly considering CLL × FL NHL groups. The highest 𝐴𝑐𝑐 value for each investigated comparison is
and FL × MCL. Thus, the SampEn values can be analyzed using different highlighted in bold.
classifiers, according to the proposed methodology. Considering the results shown on Tables 3 and 4, the highest 𝐴𝑐𝑐
From the values obtained with SampEn, the signatures were con- rates were achieved from images segmented by a specialist, whether
structed for different observations of 𝑚. For example, Fig. 9 displays involving the signature vectors or the classification provided by each
the SampEn signatures obtained from the images indicated in Fig. 2. SampEn value, even when the images were segmented with CNN
Notably, signatures of an image are different for each observation of models (Tang et al., 2018; Tokime et al., 2018). Also, taking as a main
𝑚. Moreover, considering the observation 𝑚 = 1, the SampEn produced goal the distinction among the three classes, CLL × FL × MCL, the best
values that were initially at the same tolerance level of 𝑟 = 0.06, result was 87.86% of 𝐴𝑐𝑐, exploring the strategy of SampEn signatures
regardless of group (Fig. 9(a)). This did not occur for other observations with the bGWO method. This strategy indicates that the use of SampEn
of 𝑚, in which SampEn did not identify similarities among the ana- signatures contributes to the process of classifying the NHL groups,
lyzed windows and, consequently, each class presented SampEn values especially in comparison with the results provided through a strategy
from distinct tolerance levels 𝑟. These characteristics are illustrated in commonly explored in the specialized literature on the topic (da Silva
Figs. 9(b) (𝑚 = 2), 9(c) (𝑚 = 3) and 9(d) (𝑚 = 4). This behavior has et al., 2014; Richman & Moorman, 2000; Silva et al., 2016). For
been properly discussed by Richman and Moorman (2000) and Azami, example, when results via the classification of each SampEn value are
Escudero, and Humeau-Heurtier (2017). Thus, the features obtained considered (Table 4), the highest 𝐴𝑐𝑐 value was of 72.25% with the
can indicate that distinct patterns were quantified in the signatures of K*, GA, PSO, and bGWO methods. Finally, the classifications involving
each NHL class. comparisons of CLL × FL × MCL with original or normalized images
Tests were defined to show the superiority of texture signatures provided less expressive results than those previously highlighted.
for quantifying and classifying NHL groups, according to the strategies
presented in Section 3.2.3. The compared groups were CLL × FL, CLL 4.2. Analysis of the NHL groups via metrics obtained from the SampEn
× MCL, FL × MCL and CLL × FL × MCL. Table 3 shows the best results signatures
obtained from the SampEn signatures. The highest 𝐴𝑐𝑐 value for each
investigated situation is highlighted in bold. The second strategy was Another way to verify the discriminative power of SampEn signa-
applied to analyze the discriminative power of each SampEn value tures was through metrics extracted from the curves. Thus, the metrics
(analyzed individually). The best results obtained with this strategy were calculated from the SampEn signatures (Section 3.2.1) and the

13
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Fig. 9. Examples of SampEn signatures for the histological images shown in Fig. 2. The patterns were calculated with 𝑟 ranging from 0.06 to 0.4 for the observations 𝑚 = 1 (a),
𝑚 = 2 (b), 𝑚 = 3 (c), and 𝑚 = 4 (d).

results were provided as inputs for distinct classifiers (Section 3.3). (FL × MCL) and 88.43% (CLL × FL × MCL). In addition, considering
The performance provided via signature metrics from SampEn are the results achieved from the segmentation techniques described by De
shown on Table 5, with indications in bold of the best rates (𝐴𝑐𝑐) Oliveira et al. (2013), Silva et al. (2019), Tang et al. (2018), Tokime
for each situation investigated. The classifications were performed et al. (2018), Tosta, Faria, et al. (2017), Vahadane and Sethi (2013),
considering the original histological images and the corresponding and Wienert et al. (2012), the best situation to distinguish the CLL ×
results obtained after applying the multiple segmentation and color FL × MCL groups was through the proposal presented by De Oliveira
normalization methods. et al. (2013), with 81.50% of 𝐴𝑐𝑐 rate.
From Table 5, it is possible to observe some highlights regarding Regarding the classifications obtained via normalized images, other
the use of metrics obtained from the SampEn signatures. For instance, highlights can also be verified. The quantification of this type of image
these results are relevant compared to those previously achieved, with through signature metrics indicates less expressive rates, with 𝐴𝑐𝑐
emphasis on the classifications using original histological images (non- values of 82.53% (CLL × FL), 79.69% (FL × MCL), and 63.10% (CLL ×
segmented and non-normalized). This is an important contribution FL × MCL), considering the model presented by Reinhard et al. (2001).
regarding how SampEn can be better explored for the process of quan- In the comparison of CLL × MCL, the best result (𝐴𝑐𝑐 rate of 78.57%)
tification and recognition of complex textures, such as NHL images. In was obtained with normalized images after applying the technique
this combination, the 𝐴𝑐𝑐 rates were greater than 98% in all compar- described by Macenko et al. (2009).
isons. The best performance occurred in the CLL × FL classification, In this context, the best association was defined via signature met-
with an 𝐴𝑐𝑐 rate of 99.60%. Thus, the quantifications using the signa- rics from the original histological images with the bGWO approach.
ture metrics from SampEn were able to distinguish different patterns This combination was able to distinguish the three NHL groups (CLL
of NHL, even when different classifiers were applied. However, the × FL × MCL) with an 𝐴𝑐𝑐 rate of 98.84%. Moreover, this combination
signature metrics from the segmented or normalized images indicated revealed the most used metrics in this comparison. Thus, a summary of
classifications with less expressive performances. For these datasets, the best association is shown in Table 6. It is noticed that the metrics 𝐴
the obtained results illustrate a possible limitation of this approach. and 𝑅 were selected in most observations 𝑚, followed by the skewness
A possible explanation is that the patterns defined by the spatial and 𝑆 and maximum point of entropy 𝑃𝑚𝑎𝑥 , obtained from 𝑚 equal to 2
textural arrangements (colors) were degraded from the segmentation or and 3. We believe that this is another important contribution of our
normalization processes, making it difficult to identify similar windows proposal, especially for expanding the quantitative capacity of SampEn,
during the calculation of the SampEn and, consequently, the texture in addition to providing and supporting new interpretations and asso-
patterns in the images. Even so, for segmented images, the best results ciations for this descriptor. In this regard, it is observed that important
were defined by a specialist (gold standard) (Tosta, Faria, et al., 2017), approaches explore some challenges of the SampEn for quantifying and
with 𝐴𝑐𝑐 rates of 98.64% (CLL × FL), 97.29% (CLL × MCL), 90.06% classifying patterns, such as properly defining the parameters 𝑚 and

14
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 3
Highest 𝐴𝑐𝑐 rates achieved through SampEn signatures.
Database CLL × FL CLL × MCL FL × MCL CLL × FL × MCL
84.37% 70.21% 74.71% 59.52%
Original
bGWO bGWO GA and PSO bGWO

97.29% 95.49% 93.16% 87.86%


Segmented by specialist
PSO and bGWO SVM, K*, GA, PSO and bGWO bGWO bGWO

97.29% 93.69% 88.19% 85.54%


Segmented by De Oliveira et al. (2013)
bGWO bGWO bGWO bGWO

90.54% 93.63% 78.75% 68.60%


Segmented by Vahadane and Sethi (2013)
bGWO bGWO bGWO bGWO

93.24% 93.69% 89.44% 83.23%


Segmented by Wienert et al. (2012)
SVM, GA and PSO bGWO bGWO bGWO

95.94% 95.49% 87.57% 80.34%


Segmented by Tosta, Faria, et al. (2017)
PSO and bGWO bGWO PSO bGWO

91.89% 88.89% 83.54% 70.00%


Segmented by Tokime et al. (2018)
bGWO SVM and RF bGWO bGWO

83.78% 89.19% 78.88% 69.94%


Segmented by Tang et al. (2018)
SVM SVM bGWO bGWO

85.16% 89.19% 81.37% 73.41%


Segmented by Silva et al. (2019)
bGWO SVM bGWO bGWO

83.33% 71.91% 82.75% 64.97%


Color normalization (Reinhard et al., 2001)
bGWO bGWO bGWO bGWO

79.76% 67.65% 75.47% 58.55%


Color normalization (Macenko et al., 2009)
bGWO PSO and bGWO PSO RF

77.04% 65.92% 90.85% 53.29%


Color normalization (Khan et al., 2014)
bGWO bGWO bGWO PSO

75.21% 68.86% 74.60% 58.68%


Color normalization (Vahadane et al., 2016)
bGWO PSO RF RF

𝑟 (da Silva Senra Filho et al., 2019; dos Santos et al., 2018; Humeau- 5.1. Analysis of the best association under the influence of noise
Heurtier, 2019; Richman & Moorman, 2000; Silva et al., 2018, 2016;
Yentes et al., 2013). However, we show here that signature metrics The best association for distinguishing NHL groups was evaluated
from SampEn are more robust for organizing multiple quantifications by adding different types of noise, as described in Section 3.4.1. Each
and, consequently, support the pattern recognition of complex textures, test indicated the discriminative capacity of the proposed model under
such as histological samples of NHL. adverse conditions: feature sets and images with noise levels of 20%,
Finally, Table 7 displays a summary with the performances of the 40% and, 80%. Thus, the 𝐴𝑐𝑐 values are presented in Figs. 10 and 11.
different strategies based on SampEn to classify the three NHL groups. Taking into account the results shown in Fig. 10, it is possible to
Notably, signature metrics from SampEn was able to provide the most verify that the worst cases occurred in tests with noise levels of 80%.
relevant result for a classification involving CLL × FL × MCL, using Therefore, the method fails in extreme conditions: 𝐴𝑐𝑐 rates were only
original H&E images and a reduced number of features. In addition, 50%. Also, the test considering 𝐶𝑡𝑟𝑎𝑖𝑛 × 𝐷𝑡𝑒𝑠𝑡 with noise level of 40%
this association does not require additional steps for preprocessing the was not able to provide relevant results. In this case, the 𝐴𝑐𝑐 rate was
images, such as segmentation and color normalization techniques. It
66.15%. In these experiments, the signature metrics presented more
is also observed that bGWO with KNN was able to provide the best
difficulties in distinguishing the lymphoma groups. On the other hand,
solutions (Emary et al., 2016; Mirjalili et al., 2014), including the
the 𝐶𝑡𝑟𝑎𝑖𝑛 × 𝐷𝑡𝑒𝑠𝑡 test with a noise level of 20% provided the highest
most relevant signature metrics to distinguish the NHL groups. The
accuracy (92.79%). In this evaluation, the signature metrics were able
second best association was defined from the segmented images by
to provide an acceptable distinction. Another important fact is that the
a specialist and using 36 features among the 72 that constitute the
best association was not totally affected in tests considering 𝐷𝑡𝑟𝑎𝑖𝑛 ×𝐶𝑡𝑒𝑠𝑡
SampEn signatures. The association that indicated the lowest rate was
and 𝐷𝑡𝑟𝑎𝑖𝑛 × 𝐷𝑡𝑒𝑠𝑡 , even with noise levels of 40%. The obtained results
through the classification provided by each SampEn value, exploring
were 𝐴𝑐𝑐 rates from 81.26% to 89.84%, allowing to show that the best
segmented images by the specialist.
association is promising for distinguishing the NHL groups, even with
5. Discussion some adverse conditions.
Fig. 11 presents the 𝐴𝑐𝑐 rates obtained by adding RPC and WGN to
In this section, some additional highlights regarding the discrimi- the lymphoma images classified by the best association, also in different
native ability involving the main association (signature metrics from levels. With noise levels of 20% and 40%, the corrupted images reached
SampEn with original histological images and bGWO approach) are 𝐴𝑐𝑐 values between 60.43% and 64.70%. With the 80% noise level,
provided. The information was organized into three sections: analy- the results are even lower, with a maximum rate of 52.40%. It is
sis of the best association under the influence of noise; comparison notable that the different types of noise do not have great differences
with previous works: an overview; comparison with other texture in the obtained results, with the maximum difference of 2.94% in the
descriptors. 40% noise level. Therefore, the image noise types analyzed are not a

15
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 4
Highest 𝐴𝑐𝑐 rates achieved through the classification provided by each SampEn value.
Database CLL × FL CLL × MCL FL × MCL CLL × FL × MCL
79.29% 66.80% 71.09% 59.91%
Original
K* GA, PSO and bGWO RF RF

95.94% 98.19% 77.01% 72.25%


Segmented by specialist
K*, RF, GA, PSO and bGWO RF, GA, PSO and bGWO SVM and K* K*, GA, PSO and bGWO

89.18% 91.89% 73.91% 68.20%


Segmented by De Oliveira et al. (2013)
K* RF, GA, PSO and bGWO K* K*

85.13% 89.09% 75.00% 70.93%


Segmented by Vahadane and Sethi (2013)
K* All K* K*

87.83% 90.09% 72.04% 68.78%


Segmented by Wienert et al. (2012)
K* RF, GA, PSO and bGWO K* K*

91.89% 89.18% 67.70% 63.00%


Segmented by Tosta, Faria, et al. (2017)
RF, GA, PSO and bGWO SVM, K* and RF K* K*

83.78% 92.60% 69.03% 61.18%


Segmented by Tokime et al. (2018)
SVM and K* RF, GA, PSO and bGWO K* K*

83.78% 89.18% 66.45% 58.38%


Segmented by Tang et al. (2018)
SVM and K* SVM and K* GA, PSO and bGWO K*

85.14% 89.18% 65.22% 64.74%


Segmented by Silva et al. (2019)
RF, GA, PSO and bGWO SVM and K* RF, GA, PSO and bGWO RF

78.54% 60.85% 72.03% 52.94%


Color normalization (Reinhard et al., 2001)
K* RF K* K*

75.79% 67.23% 73.18% 56.14%


Color normalization (Macenko et al., 2009)
K* K* K* K*

72.54% 62.83% 84.14% 50.00%


Color normalization (Khan et al., 2014)
K* K* All K*

76.05% 64.15% 71.82% 54.70%


Color normalization (Vahadane et al., 2016)
K* K* SVM K*

significant factor since the results are all negatively impacted. Through
this analysis, it is possible to observe that the best association is more
robust to noises added to the feature sets than to the processed images.
Fig. 11 indicates that both types of noise have more influence on
the classification than the test illustrated by Fig. 10. Considering the
three evaluations of Fig. 10, the image noise adding (Fig. 11) is only
comparable with the 80% level, with similar values in all tests, but with
lower 𝐴𝑐𝑐 values in all remaining analyses.

5.2. Comparison with previous works: An overview

Different approaches have been presented to classify NHL groups,


such as the described by Bai et al. (2019), Brancati et al. (2019),
Candelero et al. (2020), Codella et al. (2016), do Nascimento et al.
(2018), Janowczyk and Madabhushi (2016), Jiang et al. (2018), Mar-
tins et al. (2019), Meng et al. (2010), Orlov et al. (2010), Ribeiro et al.
(2018), Roberto et al. (2021, 2017), Song et al. (2012), Taino et al.
(2020), Tambe et al. (2019) and, Zhang et al. (2021). These authors
also explored the same NHL dataset, especially due to the lack of other
public sets with the technical conditions defined by National Cancer
Institute (2022) and National Institute on Aging (2022). Even with tests
based on a single dataset, each reported information has contributed
to the literature focused on related issues. On the other hand, these Fig. 10. 𝐴𝑐𝑐 rates after applying different noise levels to the feature sets.

models have not explored an approach based on SampEn signatures.


Thus, an overview of our method in comparison to the related works
is useful to illustrate the relevance of both the proposed strategy and available in the literature, even when we consider models based on
the results achieved. Accuracy rates reported in previously published CNN (Bai et al., 2019; Brancati et al., 2019; Janowczyk & Madabhushi,
studies and those achieved in this paper are shown on Table 8, which 2016; Roberto et al., 2021; Tambe et al., 2019; Zhang et al., 2021).
includes maximum values of 𝐴𝑐𝑐. Also, by taking into account that these models use different types
Regarding the studies considering NHL images (Table 8), it is pos- and totals of features to evaluate distinct properties in NHL images,
sible to note that our approach may indicate results among those we consider the results to be rather complementary than comparable.

16
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 5
𝐴𝑐𝑐 rates obtained through the signature metrics from SampEn, considering the different types of images and NHL groups.
Database CLL × FL CLL × MCL FL × MCL CLL × FL × MCL
99.60% 98.72% 99.32% 98.84%
Original
bGWO bGWO bGWO bGWO

98.64% 97.29% 90.06% 88.43%


Segmented by specialist
GA GA, PSO and bGWO GA, PSO and bGWO bGWO

93.24% 90.99% 86.33% 79.76%


Segmented by Wienert et al. (2012)
PSO bGWO GA RF

91.89% 95.94% 90.06% 81.50%


Segmented by De Oliveira et al. (2013)
GA bGWO GA, PSO and bGWO GA and bGWO

91.89% 89.09% 76.25% 70.93%


Segmented by Vahadane and Sethi (2013)
GA, PSO and bGWO SVM SVM SVM

93.24% 89.18% 89.18% 74.56%


Segmented by Tosta, Faria, et al. (2017)
GA SVM bGWO SVM

93.24% 88.89% 84.18% 72.35%


Segmented by Tokime et al. (2018)
GA and bGWO SVM and RF GA and bGWO PSO and bGWO

87.84% 89.19% 87.58% 73.99%


Segmented by Tang et al. (2018)
GA, PSO and bGWO SVM RF RF

87.84% 89.18% 85.09% 77.45%


Segmented by Silva et al. (2019)
PSO and bGWO SVM RF RF

82.53% 66.80% 79.69% 63.10%


Color normalization (Reinhard et al., 2001)
GA, PSO and bGWO bGWO PSO and bGWO GA and bGWO

78.57% 78.57% 74.71% 58.28%


Color normalization (Macenko et al., 2009)
bGWO GA and PSO PSO RF

73.77% 66.81% 86.58% 54.39%


Color normalization (Khan et al., 2014)
bGWO bGWO GA and PSO bGWO

76.47% 73.11% 75.79% 54.70%


Color normalization (Vahadane et al., 2016)
bGWO GA RF PSO and bGWO

Table 6
Summary of the best association for distinguishing lymphoma classes:
CLL × FL × MCL.
𝑚 Metric Classification approach 𝐴𝑐𝑐
𝐴
𝑚=1
𝑅
𝐴
𝑚=2 𝑆
𝑃𝑚𝑎𝑥 bGWO+KNN 98.84%
𝑅
𝑚=3 𝑆
𝑃𝑚𝑎𝑥
𝐴
𝑚=4
𝑅

In addition, an important advantage of our model is that it uses a


reduced number of features to distinguish CLL × FL × MCL since the
main features have been identified. A reduced number of features can
lead to a more comprehensive solution for specialists and contribute
significantly to the community interested in the study of NHL images.
Moreover, we believe that our results obtained from the classification Fig. 11. 𝐴𝑐𝑐 rates after adding the random pixel corruption (RPC) and white Gaussian
of segmented and normalized images could encourage the development noise (WGN).
of new techniques for color normalization and segmentation of NHL
images.
2011), lacunarity (Căliman & Ivanovici, 2012; Ivanovici et al., 2009),
5.3. Comparison with other texture descriptors
and percolation (Roberto et al., 2021, 2017). These techniques use

The best association of the proposed method (signature metrics, multidimensional and multiscale approaches that are capable of pro-
original histological images, and bGWO method) was also compared viding more complete quantifications of the images, using the RGB
with other texture descriptors: fractal dimension (Ivanovici & Richard, color model as a n-dimensional space or a cubic organization. These

17
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Table 7
Summary of the different strategies used to classify the CLL × FL × MCL groups.
Composition 𝐴𝑐𝑐 Image type Number of features Classification approach
Signature metrics from SampEn 98.84% Original 10 bGWO
SampEn signatures 87.86% Segmented by specialist 36 bGWO
Classification from each SampEn value 72.25% Segmented by specialist 1 K*, PSO, bGWO and GA

Table 8
𝐴𝑐𝑐 rates of the proposed method and related works, with information about the features, types of cross-validation and classification methods.
Authors Features Number of Classifier Validation 𝐴𝑐𝑐
features
do Nascimento et al. (2018) Color channels, morphological ≈ 145 Polynomial cross-validation 10-folds 100%
features
Bai et al. (2019) CNN, texture, statistics and color ≈ 1, 188 CNN e RF hold-out 99.10%
Orlov et al. (2010) Fourier and wavelet 12–200 WND-CHARM cross-validation 8-folds 99%
Proposed method Signatures metrics from SampEn 10 bGWO cross-validation 3-folds 98.72%–99.60%
Zhang et al. (2021) Deep-learned features 2048 GA and ResNet-50 hold-out 98.63%
Jiang et al. (2018) Morphology, entropy, and other 680 Hierarchical SVM cross-validation 10-folds 97.96%
descriptors
Brancati et al. (2019) Deep-learned features ≈ 2048 CNN cross-validation 5-folds 97.67%
Tambe et al. (2019) Deep-learned features ≈ 1000 CNN hold-out 97.33%
Martins et al. (2019) Fractal dimension and lacunarity 18 Hermite polynomial cross-validation 10-folds 97.00%
with RGB and LAB color models
Song et al. (2016) Visual features 10,384 SVM cross-validation 5-folds 96.80%
Janowczyk and Madabhushi Deep-learned features 4 CNN cross-validation 5-folds 96.58%
(2016)
Roberto et al. (2017) Percolation theory 15 DECORATE cross-validation 10-folds 96.40%
Roberto et al. (2021) ResNet-50 and fractal features 2048 Double-CNN ensemble cross-validation 10-folds 95.55%
(DL+HC)
Codella et al. (2016) Color settings enhancement 200 SVM cross-validation 3-folds 95.50%
Taino et al. (2020) Fractal features, Curvelet and 73 GA with KNN cross-validation 10-folds 94.70%
Haralick
Meng et al. (2010) Visual features, wavelet, Gabor e 50 C-RSPM+WMVA cross-validation 3-folds 92.70%
LBP
Ribeiro et al. (2018) Fractal features, Curvelet 43 Decision tree cross-validation 10-folds 86.14%
sub-images and Haralick
Candelero et al. (2020) LeNet-5, Haralick, fractal features 18 K* cross-validation 3-folds 82.01%
and GA

Table 9
Classification of the CLL × FL × MCL groups using different types of texture descriptors.
Features Database Classification approach 𝐴𝑐𝑐
Signature metrics from SampEn (Proposed method) Original bGWO with KNN 98.84%
Percolation (Roberto et al., 2021, 2017) Original GA e PSO with KNN 85.54%
Lacunarity (Căliman & Ivanovici, 2012; Ivanovici et al., 2009) Segmented by specialist RF 70.52%
Fractal dimension (Ivanovici & Richard, 2011) Original K* 61.27%

descriptors were applied to quantify the histological NHL samples (orig- the escape probability parameter needs to be in the interval [0, 1].
inal, segmented, and normalized). The classifications were performed However, there is a lack of published studies regarding the ideal values
with the classifiers presented in Section 3.3 and the obtained results for this parameter (Song, Fong, Deb, & Hanne, 2018). On the other
(𝐴𝑐𝑐 values) are shown on Table 9. It is possible to highlight that hand, the experiments and indications presented here were able to pro-
the descriptors provided the best results from the original histologi- vide a relevant overview of our proposal compared to the consolidated
cal images and also from those segmented by the specialists. These approaches commonly explored in the literature for classifying NHL
combinations were also identified via SampEn signatures. Among the images. Moreover, considering that our strategy was able to indicate the
compared descriptors, percolation (Roberto et al., 2021, 2017) was main associations, with a reduced number of features, the information
responsible for providing the highest 𝐴𝑐𝑐 rate (85.54%) via original presented can contribute to the development of computational models
images. However, this result is lower than the value achieved via addressed to aid specialists interested in the composition of machine
our best association (signature metrics), which further enhances the learning techniques, both for NHL samples and other contexts of H&E
relevance of the proposed method, even from grayscale images. images.
We believe that more analysis could be carried out to define
whether these results are maintained in new experiments, with different 6. Conclusion
parameters and other methods to support each indication, or even to
make adjustments and define the limits of each parameter or technique. In this paper, a new interpretation of SampEn values was accom-
For instance, an integrated approach is useful to support the assessment plished via the exploration of parameter intervals of 𝑚 and 𝑟. These
and ranking of concepts (Samanlioglu & Ayağ, 2021), here the main values were considered to be SampEn signatures. To demonstrate the
indications (Tables 5, 7 and 9), providing a proof of concept regarding relevance of using SampEn signatures, three tests were performed: a
the best combination. Also, when some methods are considered (GA, classification based on SampEn texture signatures; classification pro-
PSO, and bGWO), it is possible to perform experiments involving vided by each SampEn value; and, classification using signature metrics
population size, parameter optimization and convergence analysis in from SampEn, an approach that had not been explored in the literature.
order to verify the impacts of these in the process of definition of the This last strategy improved the quantitative capacity of SampEn, with
best solution. In addition, taking into account the bGWO algorithm, more interpretations and associations for this descriptor. For each

18
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

test, analyses with multiple segmentation and color normalization ap- Declaration of competing interest
proaches were investigated in detail. The best association was defined
by signature metrics from SampEn, original histological images and The authors declare that they have no known competing finan-
bGWO method. This solution brings important alternatives for the cial interests or personal relationships that could have appeared to
improvement of CAD systems since it provides accurate results and influence the work reported in this paper.
does not require preprocessing steps. Only ten features were used to
successfully distinguish CLL × FL × MCL, representing a combination Acknowledgments
with a total of descriptors considerably lower than most state-of-the-
art proposals. Therefore, signature metrics from SampEn are relevant This study was financed in part by the : Coordenação de Aperfeiçoa-
attributes for pattern recognition of NHL textures. The main features mento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code
were 𝐴 and 𝑅 present in most of the observations 𝑚, followed by the 001; National Council for Scientific and Technological Development
𝑆 and 𝑃𝑚𝑎𝑥 metrics, obtained only from 𝑚 equal to 2 and 3. CNPq (Grants #313643/2021-0, #430965/2018-4 and #311404/2021-
Another important contribution is that, even in tests considering fea- 9); and State of Minas Gerais Research Foundation - FAPEMIG (Grant
ture sets with distinct levels of noise, the best association was capable #APQ-00578-18).
of classifying the NHL cancer type. This fact was observed after adding
noise levels of 20% and 40%. The results obtained by our proposed References
method were only hindered at noise levels of 80%. Also, it was possible
Albayrak, A., Akhan, A. U., Calik, N., Capar, A., Bilgin, G., Toreyin, B. U., et al. (2021).
to identify that RPC and WGN have more influence on the model than A whole-slide image grading benchmark and tissue classification for cervical cancer
noise added to the feature sets. On the other hand, the performance precursor lesions with inter-observer variability. Medical & Biological Engineering &
with signature metrics provided important distinctions when compared Computing, 59(7), 1545–1561.
to other techniques available in the literature. Another limitation lies in Alcaraz, R., & Rieta, J. J. (2010). A review on sample entropy applications for the
non-invasive analysis of atrial fibrillation electrocardiograms. Biomedical Signal
applying the method in segmented or color normalized images, wherein
Processing and Control, 5(1), 1–14.
SampEn signature metrics could not perform well, especially for the Ali, T., Masood, K., Irfan, M., Draz, U., Nagra, A. A., Asif, M., et al. (2020). Multistage
three-classes comparison, despite reaching accuracy values above 90% segmentation of prostate cancer tissues using sample entropy texture analysis.
on some two-classes comparison scenarios. This indicates that further Entropy, 22(12), 1370.
American Cancer Society (2022a). Cancer facts & statistics. https://
analysis of the features under pre-processed and segmented images is
cancerstatisticscenter.cancer.org. (Accessed 21 January 2022).
required to understand this behavior. Thus, even though the proposal American Cancer Society (2022b). Lymphoma. https://www.cancer.org/cancer/
failed to overcome the results of some consolidated techniques, our lymphoma.html. (Accessed 21 January 2022).
study was able to indicate details of the most relevant solution with Azami, H., Escudero, J., & Humeau-Heurtier, A. (2017). Bidimensional distribution
a full understanding of the features (advantages and limits). This is an- entropy to analyze the irregularity of small-sized textures. IEEE Signal Processing
Letters, 24(9), 1338–1342.
other important contribution to the community interested in the issues Azami, H., da Silva, L. E. V., Omoto, A. C. M., & Humeau-Heurtier, A. (2019). Two-
addressed here. Therefore, we believe that the contributions provided dimensional dispersion entropy: An information-theoretic method for irregularity
by this method, such as a new association of SampEn signatures and the analysis of images. Signal Processing: Image Communication, 75, 178–187.
bGWO approach and the identification of the best features, are relevant Backes, A. R., & Bruno, O. M. (2013). Texture analysis using volume-radius fractal
dimension. Applied Mathematics and Computation, 219(11), 5870–5875.
for the area of NHL classification. We also believe our method can be Backes, A. R., Casanova, D., & Bruno, O. M. (2012). Color texture analysis based on
used either as a complementary or a benchmark for future research. fractal descriptors. Pattern Recognition, 45(5), 1984–1992.
In the future, we believe that investigations of different intervals Badrinarayanan, V., Kendall, A., & Cipolla, R. (2017). Segnet: A deep convolutional
for parameters 𝑚 and 𝑟 may enhance our understanding of the limits of encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 39(12), 2481–2495.
the signatures in the process of classifying and recognizing the patterns
Bai, J., Jiang, H., Li, S., & Ma, X. (2019). NHL pathological image classification based
of NHL. It is important to thoroughly explore SampEn calculation on hierarchical local information and GoogLeNet-based representations. BioMed
using sub-images by assessing window numbers and how different Research International, 2019.
types of distances may affect the process of verifying similarity among Battula, P., & Sharma, S. (2018). Automatic classification of Non Hodgkin‘s lymphoma
using histological images: Recent advances and directions. In 2018 International
windows. In addition, considering the methods explored here, such as
conference on advances in computing, communication control and networking (pp.
GA, PSO and bGWO, experiments involving population size, parameter 634–639). IEEE.
optimization, combinations with other classifiers (in addition to KNN) Brancati, N., De Pietro, G., Frucci, M., & Riccio, D. (2019). A deep learning approach
and convergence analysis can be performed to identify the impacts for breast invasive ductal carcinoma detection and lymphoma multi-classification
in histological images. IEEE Access, 7, 44709–44720.
on the best associations of features and techniques for the study of
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
NHL groups. Finally, SampEn signatures and signature metrics can be Căliman, A., & Ivanovici, M. (2012). Psoriasis image analysis using color lacunarity. In
applied to other types of images in order to analyze their discriminatory Optimization of electrical and electronic equipment (OPTIM), 2012 13th international
abilities under more conditions. conference on (pp. 1401–1406). IEEE.
Candelero, D., Roberto, G. F., do Nascimento, M. Z., Rozendo, G. B., & Neves, L.
A. (2020). Selection of CNN, haralick and fractal features based on evolutionary
Statements of ethical approval algorithms for classification of histological images. In 2020 IEEE international
conference on bioinformatics and biomedicine (pp. 2709–2716). IEEE.
Ethics approval was not required for this study. Cernadas, E., Fernández-Delgado, M., González-Rufino, E., & Carrión, P. (2017).
Influence of normalization and color space to color texture classification. Pattern
Recognition, 61, 120–138.
CRediT authorship contribution statement Cleary, J. G., & Trigg, L. E. (1995). K*: An instance-based learner using an entropic
distance measure. In 12th International conference on machine learning (pp. 108–114).
Guilherme Botazzo Rozendo: Conceptualization, Methodology, Codella, N., Moradi, M., Matasar, M., Sveda-Mahmood, T., & Smith, J. R. (2016). Lym-
phoma diagnosis in histopathology using a multi-stage visual learning approach. In
Software, Validation, Formal analysis, Investigation, Writing – original Medical imaging 2016: Digital pathology, vol. 9791 (p. 97910H). International Society
draft. Marcelo Zanchetta do Nascimento: Writing – review & editing. for Optics and Photonics.
Guilherme Freire Roberto: Writing – review & editing. Paulo Rogério Costa, M., Goldberger, A. L., & Peng, C.-K. (2002). Multiscale entropy analysis of
de Faria: Formal analysis, Investigation, Data curation. Adriano Bar- complex physiologic time series. Physical Review Letters, 89(6), Article 068102.
da Silva, L. E. V., da Silva Senra Filho, A. C., Fazan, V. P. S., Felipe, J. C., & Murta, L.
bosa Silva: Software. Thaína Aparecida Azevedo Tosta: Software, O. (2014). Two-dimensional sample entropy analysis of rat sural nerve aging. In
Writing – review & editing. Leandro Alves Neves: Conceptualization, Engineering in medicine and biology society (EMBC), 2014 36th annual international
Validation, Writing – review & editing, Supervision. conference of the IEEE (pp. 3345–3348). IEEE.

19
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

da Silva Senra Filho, A. C., Duque, J. J., Silva, L. E. V., Felipe, J. C., Fazan, V. P. S., & Maleki, N., Zeinali, Y., & Niaki, S. T. A. (2021). A k-NN method for lung cancer
Junior, L. O. M. (2019). Automatic diabetes detection from histological images of prognosis with the use of a genetic algorithm for feature selection. Expert Systems
rats phrenic nerve using two-dimensional sample entropy. Journal of Medical and with Applications, 164, Article 113981.
Biological Engineering, 39(1), 70–75. Martins, R. G., Martins, A. S., Neves, L. A., Lima, L. V., Flores, E. L., & do Nasci-
De Oliveira, D. L. L., Do Nascimento, M. Z., Neves, L. A., De Godoy, M. F., De Arruda, P. mento, M. Z. (2017). Exploring polynomial classifier to predict match results in
F. F., & de Santi Neto, D. (2013). Unsupervised segmentation method for cuboidal football championships. Expert Systems with Applications, 83, 79–93.
cell nuclei in histological prostate images based on minimum cross entropy. Expert Martins, A. S., Neves, L. A., Faria, P. R., Tosta, T. A., Bruno, D. O., Longo, L. C., et
Systems with Applications, 40(18), 7331–7340. al. (2019). Colour feature extraction and polynomial algorithm for classification of
do Nascimento, M. Z., Martins, A. S., Tosta, T. A. A., & Neves, L. A. (2018). lymphoma images. In Iberoamerican congress on pattern recognition (pp. 262–271).
Lymphoma images analysis using morphological and non-morphological descriptors Springer.
for classification. Computer Methods and Programs in Biomedicine, 163, 65–77. Mathworks (2022a). Skewness. https://www.mathworks.com/help/stats/skewness.html.
Dong, P. (2008). Fractal signatures for multiscale processing of hyperspectral image (Accessed 21 January 2022).
data. Advances in Space Research, 41(11), 1733–1743. Mathworks (2022b). Trapezoidal numerical integration. https://www.mathworks.com/
dos Santos, L. F. S., Neves, L. A., Rozendo, G. B., Ribeiro, M. G., do Nascimento, M. Z., help/matlab/ref/trapz.html. (Accessed 21 January 2022).
& Tosta, T. A. A. (2018). Multidimensional and fuzzy sample entropy (SampEnMF) Meng, T., Lin, L., Shyu, M.-L., & Chen, S.-C. (2010). Histology image classification
for quantifying H&E histological images of colorectal cancer. Computers in Biology using supervised classification and multimodal fusion. In Multimedia (ISM), 2010
and Medicine, 103, 148–160. IEEE international symposium on (pp. 145–152). IEEE.
Emary, E., Zawbaa, H. M., & Hassanien, A. E. (2016). Binary grey wolf optimization Mirjalili, S., Mirjalili, S. M., & Lewis, A. (2014). Grey wolf optimizer. Advances in
approaches for feature selection. Neurocomputing, 172, 371–381. Engineering Software, 69, 46–61.
Fabbri, R., Goncalves, W. N., Lopes, F. J., & Bruno, O. M. (2012). Multi-q pattern Morel, C., & Humeau-Heurtier, A. (2021). Multiscale permutation entropy for
analysis: A case study in image classification. Physica A: Statistical Mechanics and two-dimensional patterns. Pattern Recognition Letters.
its Applications, 391(19), 4487–4496. Nascimento, M. Z. d., Neves, L., Duarte, S. C., Duarte, Y. A. S., & Batista, V. R. (2015).
Feng, Y., Zhang, L., & Yi, Z. (2018). Breast cancer cell nuclei classification in Classification of histological images based on the stationary wavelet transform.
histopathology images using deep neural networks. International Journal of Computer Journal of Physics: Conference Series, 574(1), Article 012133.
Assisted Radiology and Surgery, 13(2), 179–191. National Cancer Institute (2022). Comprehensive cancer information. https://www.
Garner, S. R., et al. (1995). Weka: The waikato environment for knowledge analysis. cancer.gov/. (Accessed 21 January 2022).
In Proceedings of the New Zealand computer science research students conference (pp. National Institute on Aging 2022. https://www.nia.nih.gov/. (Accessed 21 January
57–64). Citeseer. 2022).
Gaudêncio, A. S. F., Vaz, P. G., Hilal, M., Cardoso, J. M., Mahé, G., Lederlin, M., et Oger, M., Belhomme, P., & Gurcan, M. N. (2012). A general framework for the
al. (2020). Three-dimensional multiscale fuzzy entropy: validation and application segmentation of follicular lymphoma virtual slides. Computerized Medical Imaging
to idiopathic pulmonary fibrosis. IEEE Journal of Biomedical and Health Informatics, and Graphics, 36(6), 442–451.
25(1), 100–107. Orlov, N. V., Chen, W. W., Eckley, D. M., Macura, T. J., Shamir, L., Jaffe, E. S.,
Hallek, M. (2017). Chronic lymphocytic leukemia: 2017 update on diagnosis, risk et al. (2010). Automatic classification of lymphoma images with transform-based
stratification, and treatment. American Journal of Hematology, 92(9), 946–965. global features. IEEE Transactions on Information Technology in Biomedicine, 14(4),
1003–1013.
Hans, R., & Kaur, H. (2022). Improved local search based grey wolf optimization
Pei, M., Goodman, E. D., Punch, W. F., & Ding, Y. (1995). Genetic algorithms for
for feature selection. In Proceedings of the international conference on industrial and
classification and feature extraction. In Classification society conference (pp. 1–28).
manufacturing systems (pp. 371–387). Springer.
Peleg, S., Naor, J., Hartley, R., & Avnir, D. (1984). Multiple resolution texture analysis
Haralick, R. M., Shanmugam, K., et al. (1973). Textural features for image classification.
and classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, (4),
IEEE Transactions on Systems, Man, and Cybernetics, (6), 610–621.
518–523.
Humeau-Heurtier, A. (2019). Texture feature extraction methods: A survey. IEEE Access,
Platt, J. C. (1999). 12 Fast training of support vector machines using sequential minimal
7, 8975–9000.
optimization. Advances in Kernel Methods, 185–208.
Humeau-Heurtier, A. (2021). Entropy analysis in health informatics. In Signal processing
Ragothaman, S., Narasimhan, S., Basavaraj, M. G., & Dewar, R. (2016). Unsupervised
techniques for computational health informatics (pp. 123–143). Springer.
segmentation of cervical cell images using gaussian mixture model. In Proceedings of
Instituto Nacional de Câncer (2022). Cancer incidence in Brazil. https://www.inca.gov.
the IEEE conference on computer vision and pattern recognition workshops (pp. 70–75).
br/estimativa. (Accessed 21 January 2022).
Raju, A. R., Pabboju, S., & Rao, R. R. (2020). Brain image classification using dual-tree
Irshad, H., Veillard, A., Roux, L., & Racoceanu, D. (2013). Methods for nuclei detection,
M-band wavelet transform and naïve Bayes classifier. In Intelligent computing in
segmentation, and classification in digital histopathology: a review—current status
engineering (pp. 635–642). Springer.
and future potential. IEEE Reviews in Biomedical Engineering, 7, 97–114.
Ramachandran, S., Strisciuglio, N., Vinekar, A., John, R., & Azzopardi, G. (2020). U-
Ivanovici, M., & Richard, N. (2011). Fractal dimension of color fractal images. IEEE
COSFIRE filters for vessel tortuosity quantification with application to automated
Transactions on Image Processing, 20(1), 227–235.
diagnosis of retinopathy of prematurity. Neural Computing and Applications, 32(16),
Ivanovici, M., Richard, N., & Decean, H. (2009). Fractal dimension and lacunarity of 12453–12468.
psoriatic lesions-A colour approach. Medicine, 6(4), 7. Reinhard, E., Adhikhmin, M., Gooch, B., & Shirley, P. (2001). Color transfer between
Janowczyk, A., & Madabhushi, A. (2016). Deep learning for digital pathology image images. IEEE Computer graphics and applications, 21(5), 34–41.
analysis: A comprehensive tutorial with selected use cases. Journal of Pathology Ribeiro, M. G., Neves, L. A., Roberto, G. F., Tosta, T. A., Martins, A. S., & Do Nasci-
Informatics, 7. mento, M. Z. (2018). Analysis of the influence of color normalization in the
Jiang, H., Li, Z., Li, S., & Zhou, F. (2018). An effective multi-classification method for classification of non-hodgkin lymphoma images. In 2018 31st SIBGRAPI conference
NHL pathological images. In 2018 IEEE international conference on systems, man, and on graphics, patterns and images (pp. 369–376). IEEE.
cybernetics (pp. 763–768). IEEE. Richman, J. S., & Moorman, J. R. (2000). Physiological time-series analysis using
Jothi, J. A. A., & Rajam, V. M. A. (2017). A survey on automated cancer diagnosis approximate entropy and sample entropy. American Journal of Physiology-Heart and
from histopathology images. Artificial Intelligence Review, 48(1), 31–81. Circulatory Physiology, 278(6), H2039–H2049.
Junior, J. J. d. M. S., Backes, A. R., & Bruno, O. M. (2018). Randomized neural network Roberto, G. F., Lumini, A., Neves, L. A., & do Nascimento, M. Z. (2021). Fractal neural
based signature for color texture classification. Multidimensional Systems and Signal network: A new ensemble of fractal geometry and convolutional neural networks for
Processing, 1–16. the classification of histology images. Expert Systems with Applications, 166, Article
Kennedy, J. (2011). Particle swarm optimization. In Encyclopedia of machine learning 114103.
(pp. 760–766). Springer. Roberto, G. F., Neves, L. A., Nascimento, M. Z., Tosta, T. A., Longo, L. C., Martins, A.
Khan, A. M., Rajpoot, N., Treanor, D., & Magee, D. (2014). A nonlinear mapping S., et al. (2017). Features based on the percolation theory for quantification of
approach to stain normalization in digital histopathology images using image- non-hodgkin lymphomas. Computers in Biology and Medicine.
specific color deconvolution. IEEE Transactions on Biomedical Engineering, 61(6), Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks
1729–1738. for biomedical image segmentation. In International conference on medical image
Li, X., Chen, H., Li, C., Rahaman, M. M., Li, X., Wu, J., et al. (2022). What can machine computing and computer-assisted intervention (pp. 234–241). Springer.
vision do for lymphatic histopathology image analysis: A comprehensive review. Samanlioglu, F., & Ayağ, Z. (2021). Concept selection with hesitant fuzzy
arXiv preprint arXiv:2201.08550. ANP-PROMETHEE II. Journal of Industrial and Production Engineering, 38(7),
Li, A.-D., Xue, B., & Zhang, M. (2021). Improved binary particle swarm optimization 547–560.
for feature selection with new initialization and search space reduction strategies. Shamir, L., Orlov, N., Eckley, D. M., Macura, T. J., & Goldberg, I. G. (2008). IICBU
Applied Soft Computing, 106, Article 107302. 2008: a proposed benchmark suite for biological image analysis. Medical & Biological
Macenko, M., Niethammer, M., Marron, J. S., Borland, D., Woosley, J. T., Guan, X., et Engineering & Computing, 46(9), 943–947.
al. (2009). A method for normalizing histology slides for quantitative analysis. In Silva, L. E., Duque, J. J., Felipe, J. C., Murta, L. O., Jr., & Humeau-Heurtier, A.
2009 IEEE international symposium on biomedical imaging: From nano to macro (pp. (2018). Two-dimensional multiscale entropy analysis: Applications to image texture
1107–1110). IEEE. evaluation. Signal Processing, 147, 224–232.

20
G.B. Rozendo et al. Expert Systems With Applications 202 (2022) 117238

Silva, A. B., Martins, A. S., Neves, L. A., Faria, P. R., Tosta, T. A., & do Nascimento, M. Tosta, T. A. A., de Faria, P. R., Servato, J. P. S., Neves, L. A., Roberto, G. F., Martins, A.
Z. (2019). Automated nuclei segmentation in dysplastic histopathological oral S., et al. (2019). Unsupervised method for normalization of hematoxylin-eosin
tissues using deep neural networks. In Iberoamerican congress on pattern recognition stain in histological images. Computerized Medical Imaging and Graphics, 77, Article
(pp. 365–374). Springer. 101646.
Silva, L., Senra Filho, A., Fazan, V., Felipe, J., & Junior, L. M. (2016). Two-dimensional Tosta, T. A. A., Neves, L. A., & do Nascimento, M. Z. (2017). Segmentation methods
sample entropy: assessing image texture through irregularity. Biomedical Physics & of h&e-stained histological images of lymphoma: a review. Informatics in Medicine
Engineering Express, 2(4), Article 045002. Unlocked, 9, 35–43.
Song, Y., Cai, W., Huang, H., Feng, D., Wang, Y., & Chen, M. (2016). Bioimage Vahadane, A., Peng, T., Sethi, A., Albarqouni, S., Wang, L., Baust, M., et al. (2016).
classification with subcategory discriminant transform of high dimensional visual Structure-preserving color normalization and sparse stain separation for histological
descriptors. BMC Bioinformatics, 17(1), 465. images. IEEE Transactions on Medical Imaging, 35(8), 1962–1971.
Song, Y., Crowcroft, J., & Zhang, J. (2012). Automatic epileptic seizure detection in Vahadane, A., & Sethi, A. (2013). Towards generalized nuclear segmentation in
EEGs based on optimized sample entropy and extreme learning machine. Journal histological images. In 13th IEEE international conference on bioinformatics and
of Neuroscience Methods, 210(2), 132–146. bioengineering (pp. 1–4). IEEE.
Song, Q., Fong, S., Deb, S., & Hanne, T. (2018). Gaussian guided self-adaptive wolf Vose, J. M. (2017). Mantle cell lymphoma: 2017 update on diagnosis, risk-stratification,
search algorithm based on information entropy theory. Entropy, 20(1), 37. and clinical management. American Journal of Hematology, 92(8), 806–813.
Swerdlow, S. H., Campo, E., Pileri, S. A., Harris, N. L., Stein, H., Siebert, R., et Wibmer, A., Hricak, H., Gondo, T., Matsumoto, K., Veeraraghavan, H., Fehr, D., et al.
al. (2016). The 2016 revision of the world health organization classification of (2015). Haralick texture analysis of prostate MRI: utility for differentiating non-
lymphoid neoplasms. Blood, 127(20), 2375–2390. cancerous prostate from prostate cancer and differentiating prostate cancers with
Taino, D. F., Ribeiro, M. G., Roberto, G. F., Zafalon, G. F., do Nascimento, M. Z., different gleason scores. European Radiology, 25(10), 2840–2850.
Tosta, T. A., et al. (2020). Analysis of cancer in histological images: employing an Wienert, S., Heim, D., Saeger, K., Stenzinger, A., Beil, M., Hufnagl, P., et al.
approach based on genetic algorithm. Pattern Analysis and Applications, 1–14. (2012). Detection and segmentation of cell nuclei in virtual microscopy images:
Tambe, R., Mahajan, S., Shah, U., Agrawal, M., & Garware, B. (2019). Towards a minimum-model approach. Scientific Reports, 2, 503.
designing an automated classification of lymphoma subtypes using deep neural Xerri, L., Dirnhofer, S., Quintanilla-Martinez, L., Sander, B., Chan, J. K., Campo, E., et
networks. In Proceedings of the ACM india joint international conference on data science al. (2016). The heterogeneity of follicular lymphomas: from early development to
and management of data (pp. 143–149). ACM. transformation. Virchows Archiv, 468(2), 127–139.
Tang, J., Li, J., & Xu, X. (2018). Segnet-based gland segmentation from colon Xiao, J., He, C., Jiang, X., & Liu, D. (2010). A dynamic classifier ensemble selection
cancer histology images. In 2018 33rd Youth academic annual conference of chinese approach for noise data. Information Sciences, 180(18), 3402–3421.
association of automation (pp. 1078–1082). IEEE. Yeh, J.-R., Lin, C.-W., & Shieh, J.-S. (2011). An approach of multiscale complexity in
Tokime, R. B., Elassady, H., & Akhloufi, M. A. (2018). Identifying the cells’ nuclei using texture analysis of lymphomas. IEEE Signal Processing Letters, 18(4), 239–242.
deep learning. In 2018 IEEE life sciences conference (pp. 61–64). IEEE. Yentes, J. M., Hunt, N., Schmid, K. K., Kaipust, J. P., McGrath, D., & Stergiou, N. (2013).
Too, J., & Abdullah, A. R. (2021). A new and fast rival genetic algorithm for feature The appropriate use of approximate entropy and sample entropy with short data
selection. The Journal of Supercomputing, 77(3), 2844–2874. sets. Annals of Biomedical Engineering, 41(2), 349–365.
Tosta, T. A. A., Faria, P. R., Neves, L. A., & do Nascimento, M. Z. (2017). Computational Zhang, X., Zhang, K., Jiang, M., & Yang, L. (2021). Research on the classification of
method for unsupervised segmentation of lymphoma histological images based on lymphoma pathological images based on deep residual neural network. Technology
fuzzy 3-partition entropy and genetic algorithm. Expert Systems with Applications, and Health Care, (Preprint), 1–10.
81, 223–243. Zhu, X., & Wu, X. (2004). Class noise vs. attribute noise: A quantitative study. Artificial
Tosta, T. A. A., de Faria, P. R., Neves, L. A., & do Nascimento, M. Z. (2019). Computa- Intelligence Review, 22(3), 177–210.
tional normalization of H&E-stained histological images: Progress, challenges and
future potential. Artificial Intelligence in Medicine, 95, 118–132.

21

You might also like