Professional Documents
Culture Documents
a Center for Biomedical Imaging & Informatics, Room R203, 675 Hoes Lane, Piscataway, NJ 08854, USA
b Department of Pathology & Laboratory Medicine, University of Medicine & Dentistry of New Jersey,
Piscataway, NJ 08854, USA
c Department of Electrical and Computer Engineering, Center for Advanced Information Processing,
Received 22 August 2004 ; received in revised form 4 March 2005; accepted 8 March 2005
1. Introduction
0169-2607/$ — see front matter © 2005 Elsevier Ireland Ltd. All rights reserved.
doi:10.1016/j.cmpb.2005.03.006
60 W. Chen et al.
it is essential to distinguish among subclasses of focused on pathology. These include the Pathex
pathologies [1]. The peripheral blood of patients framework and the Pathex/Red system [18] for
is routinely screened for abnormalities, however, assisting pathologists with laboratory data at Ohio
the subtle visible differences exhibited by some State University, ECLIPS [19] at the University of
disorders can lead to a significant number of false Illinois Urbana, as well as the PathFinder project on
negatives during routine microscopic evaluation of anatomic pathology diagnosis [20] at the University
specimens. of Southern California and Stanford. In PathFinder,
Mantle cell lymphoma (MCL) is an intermediate an expert system provides a differential diagnosis
grade lymphoma with a 3—5 year median survival based on the initial histological feature(s) observed
rate [2—4]. MCL morphology is generally described by the pathologists, and suggests to the user
as a monotonous proliferation of small to medium additional histological features for observation that
sized lymphoid cells with scant cytoplasm, are likely to narrow the differential diagnosis,
variably irregular, round or indented nuclei, thus helping to screen for observations which are
dispersed chromatin, and inconspicuous nucleoli. incompatible with a given disease or disorder.
However, the disease may exhibit a spectrum of While the mechanisms for content-based access
presentations, which can sometimes be confused to alphanumeric data have been extensively stud-
with other lymphomas that follow a less aggressive ied, and are now considered relatively well under-
clinical course. Chronic lymphocytic leukemia stood, content-based image access remains elusive
(CLL) and follicular center cell lymphoma (FCC) and is an active area of research in the computer
are two examples, which were considered in our vision and image processing communities, as well as
studies. The main objective of our research was to in disparate application domains, such as remote
develop and optimize image-based methods which sensing, diagnostic medicine, molecular biology,
would provide quick, reliable decision support in pharmacy, and computer aided design. Technolo-
detecting and characterizing these disorders. gies that capture, describe, and index the visual
Immunophenotyping with flowcytometry is essence of multimedia objects rely on the methods
considered a definitive approach for reliably and principles of image analysis, pattern recog-
differentiating among lymphoproliferative diseases nition, and database theory. This relatively new
[3—7]. However, because of the time and expense area of research spans a spectrum of applications
of implementing studies, it is not generally utilized [21—24]. A survey of content-based image retrieval
unless a case is first categorized as suspicious (CBIR) was recently presented by Antani et al. [25].
during microscopic evaluation. Throughout the Several general-purpose CBIR systems have been
course of our studies the diagnosis determined developed such as the IBM QBIC system [26], the
by immunophenotyping was considered as the Photobook system [27], the WBIIS system [28],
gold-standard for gauging the performance of the the Blobworld system [29], and the SIMPLIcity
image guided approach [8]. system [30]. These systems are not suited for
most pathology applications, however, because of
the special characteristics of digitized pathology
1.2. Technical significance such as the high resolution of the images and
chromatic profiles of the stained specimens. Over
Recent literature ascribes much of the difficulty the past few years there has been increased
in rendering consistent diagnoses to the subjective interest and effort applied to utilizing CBIR in
impressions of observers and shows that, when mor- medical applications [31—33]. Individual strategies
phologic cell classification is based upon computer- and approaches differ according to the degree
aided analysis, objectivity, and reproducibility can of generality (general purpose versus domain
be made to improve considerably [9—12]. Using specific), level of feature abstraction (primitive
these techniques it may be possible to detect and feature versus logical features), overall dissimi-
track subtle changes in measurable parameters larity measure used in retrieval ranking, database
leading to the discovery of novel diagnostic clues, indexing procedure, level of user intervention
which may not be apparent by human visual (with or without relevance feedback), and by the
inspection alone. methods used to evaluate their performance. The
Developing approaches that can reliably trans- Pittsburgh Supercomputing Center has developed
form complex diagnostic concepts into well- a system which utilizes global characteristics of
defined algorithmic procedures is an active area images to provide a measure of Gleason grade
of research [13—17]. Diagnostic pathology offers a of prostate tumors [34]. Wang from Pennsylvania
rich environment for conducting such studies. Thus, State University emphasizes the use of wavelet
several major projects in artificial intelligence have technology and integrated region matching (IRM)
Image mining for investigative pathology using optimized feature extraction and data fusion 61
distances for characterizing pathology images [35]. exploit the statistical methods that are utilized for
The system indexes block segments of images at discriminating among disorders.
different scales by partitioning the original image The PathMiner project consists of three sub-
into smaller overlapping blocks. The CBIR engine is systems: the Distributed Telemicroscopy (DT) sub-
interfaced with a server that allows users to browse system, the Intelligent Archiving (IA) subsystem,
portions of the original matched image at different and the IGDS subsystem. The first generation DT
scales. This system differs from the system that we subsystem [6] provides a distributed telepathology
are developing, both in design and purpose. The environment by enabling multiple users from dis-
system from Penn State emphasizes the general parate clinical and research sites to simultaneously
tasks of retrieval and browsing whereas the system control robotic microscopes from remote locations
that we have proposed focuses on developing a while each session participant receives a digital
set of portable, web-based tools for collaborative broadcast of the specimen. The IA subsystem facil-
studies and clinical decision support. itates the automated population and management
of databases through the use of unsupervised
1.3. The PathMiner project scanning and indexing of candidate lymphocytes. In
this paper, we report how through the use of robust
We have already reported the design and develop- texture descriptors and density estimation based
ment of an Image Guided Decision Support (IGDS) data fusion the reliability and performance of the
system, which was shown to reliably discriminate governing classifications was significantly improved
among a set of lymphoproliferative disorders that while simultaneously reducing the dimensionality
can sometimes be confused with one another during of the feature space.
routine microscopic evaluation. The IGDS system With minimal exceptions, the software modules
automatically identifies and retrieves images, of PathMiner project were implemented using the
diagnosis, and correlated clinical data of those JAVA programming language to provide maximum
cases from within a ‘‘gold-standard’’ database portability.
whose spectral and spatial profiles are most similar
to a given query image. The system suggests the
most likely diagnosis based on majority logic of
the retrieved cases. Man—machine performance 2. Methods
comparison studies showed that the image-guided
approach provided significant improvement in dis- 2.1. Classification optimization of IGDS
criminating among disorders while simultaneously
reducing the frequency of false negatives. One 2.1.1. Optimization condition
project which is closely related to our research is Cross validation methods are widely used in statisti-
the PathMaster system from Yale University, which cal pattern recognition to determine the robustness
is a content-based retrieval system that was used of an algorithm by evaluating generalization
to discriminate among specimens of mantle cell performance based on ‘‘resampling’’. The k-fold
lymphoma and small cell lymphomas [36] that are cross validation approach first randomly divides
prepared using a touch prep protocol. This system data into k subsets of approximately the same size.
implements semi-automatic segmentation utilizing The ‘‘test algorithm’’ is then trained with k − 1
commercial, off-the-shelf image segmentation subsets, and tested with the remaining subset to
(Adobe PhotoShop) and does not exploit the generate the appropriate performance measure.
potential of advanced computer vision techniques. The training and testing session is executed k
Within this system each cell can be represented by times, with each iteration utilizing different
more than 2000 features, but the question of which subsets for testing. Therefore, when smaller k’s
of the features belong to an optimal subset was not are used fewer samples are utilized for training thus
explored. creating a heightened challenge for the algorithm’s
The PathMiner project, that we have under- capacity to produce the same performance
taken, started as an effort to establish a large- measure.
scale web-based pathology image database, which Because of limited sample size, a 10-fold cross
could provide clinical decision support and medical validation was used in the original IGDS study [6].
education based on statistical pattern recognition In order to further evaluate the robustness and
and CBIR. Since each of the diseases under study stability of the IGDS system, a balanced mix of over
may exhibit a spectrum of morphologies, it was 900 test cells was recently established. Therefore,
important that the database contain a sufficiently experiments reported in the current study used the
large number of cases. This allows us to effectively much more stringent two-fold cross validation and
62 W. Chen et al.
were aimed to maximize correct classification rate, size of the overlapping neighborhoods must be
defined as sufficiently large to capture the texture character-
istics, while remaining small enough to maintain
correct clasification rate low texture variability across the image. The multi-
number of correctly classified cells resolution SAR (MRSAR) uses multiple neighborhood
= (1)
total number of cells sizes to accommodate complex texture types.
In the first generation IGDS prototype, the
MRSAR descriptor was computed for each region-
2.1.2. Algorithms used in the current IGDS of-interest (ROI) as described by Mao and Jain
The original IGDS scheme used a simplex strategy [40]. Three different neighborhood sizes (5 × 5,
[37] based on an eight nearest neighbor search [38] 7 × 7, and 9 × 9, see Fig. 1a) were simultane-
across the weighted-sum of distances. Weighting ously investigated to provide multi-scale texture
factors for shape, texture, and area, were tested information. At each examined nuclear pixel, the
on the new dataset. In order to gain better insight descriptor was estimated using pixel luminance (L* )
into the nature of the high-dimensional data set, value throughout a 21 × 21 window. By combining
additional experiments were conducted using joint- symmetric neighbor pixels in the estimation,
ordering (sum of rank) [6,39]. These studies demon- the resulting texture feature vector was 15-
strated a much more stable performance and re- dimensional (4 neighbor directions + 1 residual) × 3
vealed that area and texture feature measurements neighborhood sizes. The least square method was
contributed most to an optimized classification. used in the linear regression. A 15-dimensional
These observations led us to focus our efforts on mean vector and 15 × 15 covariance matrix of the
investigating the potentials of multivariate data fu- texture feature were then computed for the entire
sion for improving the discrimination performance ROI and stored for database query.
and robustness of the governing algorithms. In an attempt to improve the robustness of the
estimation while reducing the dimensionality of the
texture descriptor, the SAR model was modified
2.1.3. Region Simultaneous Autoregressive
using an approach that we refer to as region
(RSAR) model
SAR. This algorithm differs from the MRSAR model
The Simultaneous Autoregressive (SAR) model is
in two aspects. First, since the original MRSAR
a linear regressive model often used in imaging
implementation was designed with the purpose of
applications involving texture analysis. It defines
texture segmentation, estimates are generated in
the texture feature by computing SAR coefficients
relatively small local windows to maintain local
, the intersect , and the standard deviation of
information. Since the desired parameters are of
the residual ε over a specified area as
high-dimensionality (15), the estimation from a
set of 21 × 21 windows is not very robust from
I(x) = + (y)I(y) + ε(x) (2)
a statistical point of view. Another limitation of
y ∈N
the MRSAR implementation was that it positioned
in neighborhood N. Depending on data normaliza- the 21 × 21 windows with considerable overlap,
tion, can sometimes be considered 0 and omitted giving rise to a substantial amount of redundancy in
from the regression calculation. computing the estimation of local texture features.
SAR regression is usually performed over over- Since texture-based segmentation was not the goal
lapping neighborhoods centered at the pixel of of our application, we reasoned that it would not
interest. To achieve adequate texture analysis, the be necessary to estimate local texture features in
Fig. 1 Neighborhoods used in MRSAR and RSAR methods. (a) Neighborhood used in the MRSAR method. (b) Non-
symmetrical neighborhood used in the RSAR method. (c) Symmetrical neighborhood used in the RSAR method.
Image mining for investigative pathology using optimized feature extraction and data fusion 63
Fig. 2 L∗ hC∗ -based color filter training interface. Either the entire field or a subregion of the map image (upper-
left) can be used to generate appropriate L∗ hC∗ boundaries. Graphical boxes are shown in three corresponding two-
dimensional plot of the three-dimensional color space (upper-right), with the subsequent filtering result shown in the
lower-right quarter of the graphical interface.
66 W. Chen et al.
Fig. 3 Organization of the ground-truth PathMiner database showing the main tables and representative table fields.
The major entities are highlighted in gray while auxiliary tables are rendered in black and white. Please see text for
more specification.
Image mining for investigative pathology using optimized feature extraction and data fusion 67
set of auxiliary data, image formats, image Each digital report includes the diagnosis and
elements, image metrics, classifiers, and retrieval a brief description of the specimen as well as
methods. For each study, there may be several the gated cell percentage and six possible levels
categories or classes in which each specific category of fluorescence intensity for each protein marker
corresponds to a particular disorder or stage of within each cell population. To protect patient
disease. Each entry in the sample entity refers to an privacy, a database index number, which cannot be
actual physical specimen and its correlated clinical traced back to patient identity, is given to each
data that is stored in auxiliary data tables, e.g., cell report so that while patient identity is hidden from
surface protein expression profiles and molecular users, the correlated protein and/or molecular
studies. The image entity houses information characteristics of each case can be readily located
including the image name, a link to the image file, and retrieved. This interface is actively being
as well as the location of the robotic microscope utilized by individuals at two institutions (UPenn
stage at the time of acquisition. Additional and UMDNJ) to dynamically expand the ground-
information pertaining to equipment configurations truth databases.
as well as the microscopic magnification is stored A Perl-based program is under development to
in an auxiliary table as shown in Fig. 3. The automatically process flat-ASCII versions of the
image-element refers to region(s) or object(s) of correlated pathology report by collecting salient
interest within a given image. In the application fields (diagnosis, immunophenotype, molecular
reported in this paper, the image-elements are the diagnosis, cytochemistry results) while omitting all
individual lymphocytes. Each image may contain patient identifiers (Social Security Number, Medical
multiple image elements, and each element need Record Number, Case Accession Number, Address,
not necessarily be classified into the same category, Phone Number, etc.). The data and identifiers will
e.g., there may be normal lymphocytes present be developed to meet all HIPAA requirements for
in malignant lymphoma samples. The constituent sharing data anonymized for research[47].
entities of the database were designed to be
generalizable, modular, and portable thus providing
the underlying structure to support a wide range of
imaging applications. 4. Results
The graphical interface for the database pro-
4.1. Classification optimization
vides the means for conducting routine administra-
tive tasks such as creating accounts and assigning To validate the content-based image retrieval
privileges to users. The PathMiner database methods described in Section 2, two-fold cross
interface can be used to populate and manage validation was performed on a dataset composed
both local and remote databases. All users have of 202 normal, 265 MCL, 223 CLL, and 239 FCC
permission to search, index, and integrate any lymphocyte images.
databases that reside on their local computer, but The area feature measurement was the first to
such entries do not affect the contents of the be explored in our experiments. Since the area
ground-truth databases. measurement is one-dimensional, histograms of the
entire mixed test set of cells as well as that
of each individual cell class can be visualized as
3.1.4. Linking lymphocyte images to their shown in Fig. 4. The overall histogram is bimodal
because the MCL cells from the test set were
molecular characteristics
larger than the other three classes. The profiles for
In addition to archiving the digitized specimen and normal (benign) cells and those for the FCC class
its corresponding spatial and spectral profiles, the exhibited significant overlap, and thus classification
PathMiner system incorporates correlated clinical using area feature alone would not be optimal.
information into the database using customizable The bandwidth estimation for each class of cells
auxiliary data tables, thus providing a rich resource was computed as in (7) above, and was displayed
for investigative research. Specifically designed in Fig. 4 as a short bar above the mode area.
for this hematology application, a graphical user Improvements in the classification performance
interface has been developed to enable privileged using area feature alone are plotted in Fig. 7.
users to enter and retrieve flow cytometry reports To test the RSAR method, experiments were
(immunophenotype profiles) to and from the first conducted to determine the impact of varying
database. This information, along with the corre- kernel sizes (3 × 3, 5 × 5, 7 × 7, and 9 × 9), while
sponding image features, can then be statistically alternating between two different neighborhood
evaluated for patterns and inter-relationships. definitions (symmetric and non-symmetric). During
68 W. Chen et al.
Fig. 5 Classification using RSAR texture feature and fusion of RSAR texture with area. (a) Classification performance
using different window size and neighborhood settings. (b) For each neighborhood and window size, t value was
optimized to maximize fusion classification performance.
Image mining for investigative pathology using optimized feature extraction and data fusion 69
Fig. 8 Comparison of the LymphGate and human classification results in order to validate feasibility of utilizing the
LymphGate in discriminating lymphocytes from candidate cell images. Different color and shape of the markers indicate
human classification result for each cell image. The LymphGate used in this experiment was derived from empirical
data and shown in this illustration by the dashed lines. Cell images that fell inside the LymphGate were considered
candidate lymphocytes and were subject to subsequent steps of IA.
from non-lymphocytes, which themselves are flow reports. In the current stage of development
composed of several different cell groups. As a the imaged cells are reviewed by a certified
result, the distribution of the two-dimensional hematopathologist before they become part of
data, area, and roundness (as shown in Fig. 8), does the gold-standard databases. Technical highlights
not follow well-known linear or quadratic statistical of the IA system include the CAM module,
models. Second, the goal was to achieve maximal color filter, lymphocyte filter, and the image
sensitivity while maintaining a high specificity; this database.
adds yet another level of complexity if the problem Although our IGDS subsystem was designed as a
was to be adequately addressed with conventional diagnostic tool, it would have been inappropriate
statistical models. if we had used the IGDS algorithms both for
creating the ground-truth database and for testing
performance. As a result, a different strategy was
5. Discussion and future directions used in the IA subsystem for selective cell archiving.
The CAM module plays an important part
The chief objective of the PathMiner project in the IA system by providing the machine
was to expand upon the progress, which had equivalent of the ‘‘hand—eye coordination’’ that is
been achieved utilizing the first generation IGDS normally required to control a robotic system. The
prototype, and establish a large-scale, web-based CAM module automatically compensates for par-
resource, which could provide reliable decision centering and par-focal errors between objective
support for individuals evaluating ambiguous cases lenses during the pilot scan, thereby providing ac-
in hematopathology. Two approaches were utilized curate location of leukocytes for subsequent high-
in an attempt to attain this goal. One was to resolution imaging. The color filter and lymphocyte
implement the IA system, which facilitates popu- filter allow the IA system to selectively index
lating the ground-truth database by unsupervised lymphocytes from peripheral blood specimens. Al-
scanning and imaging the specimens. The other was though it is not impossible to refine the filters so as
to refine the CBIR algorithms to further improve to eliminate the small number of other leukocytes
performance and robustness. that yet remains in the candidate cell entries, these
The IA subsystem was shown to automatically cells can easily be picked out by pathologists in the
image, analyze, and archive hematopathology quality control stage of processing and therefore do
specimens while populating gold-standard not affect database integrity. The above activities
databases with resulting image metrics. The are coordinated by the IA subsystem in order to
IA also provides the means for entering correlated operate in unsupervised mode while automatically
Image mining for investigative pathology using optimized feature extraction and data fusion 71
microscopy images of pigmented skin lesions, Cytometry 37 [30] J. Li, J. Wang, G. Widerhold, IRG: integrated region
(1999) 255—266. matching for image retrieval, in: Proceedings of ACM
[15] M. Cenci, C. Nagar, A. Vecchione, PAPNET-assisted primary Multimedia, Los Angeles, 2000.
screening of conventional cervical smears, Anticancer. Res. [31] F. Schnorrenberg, C. Pattichis, C. Schizas, K. Kyriacou,
20 (2000) 3887—3889. Content-based retrieval of breast cancer biopsy slides,
[16] M.R. Kok, Y.T. van Der Schouw, M.E. Boon, D.E. Grobbee, Technol. Health Care 8 (2000) 291—297.
L.P. Kok, P.G. Schreiner-Kok, Y. van der Graaf, H. [32] C. Le Bozec, E. Zapletal, M. Jaulent, D. Heudes, P. Degoulet,
Doornewaard, J.G. van den Tweel, Neural network-based Towards content based image retrieval in a HIS-integrated
screening (NNS) in cervical cytology: no need for the PACS, in: Proceedings/AMIA Annual Symposium, 2000, pp.
light microscope? Diagn. Cytopathol. 24 (2001) 426— 477—481.
434. [33] M. Jaulent, A. Bennani, C. Le Bozec, E. Zapletal, P.
[17] P.H. Bartels, R.M. Montironi, D. Bostwick, J. Marshall, D. Degoulet, A customizable similarity measure betwen his-
Thompson, H.G. Bartels, D. Kelley, Karyometry of secretory tological cases, in: Proceedings/AMIA Annual Symposium,
cell nuclei in high-grade PIN lesions, Prostate 48 (2001) 2002, pp. 350—354.
144—155. [34] A. Wetzel, R. Crowley, S. Kim, R. Dawson, L. Zheng, Y. Joo,
[18] J.W. Smith Jr., J.R. Svirbely, C.A. Evans, P. Strohm, Y. Yagi, J. Gilbertson, C. Gadd, D. Deerfield, M. Becich,
J.R. Josephson, M. Tanner, RED: a red-cell antibody Evaluation of prostate tumor grades by content based image
identification expert module, J. Med. Syst. 9 (1985) retrieval, in: Proceedings of Spie—–the International Society
121—138. for Optical Engineering, vol. 3584, 1999, pp. 244—245.
[19] D.R. Thursh, F. Mabry, A.H. Levy, Computers and videodiscs [35] J. Wang, Pathfinder: multiresolution region-based searching
in pathology education: ECLIPS as an example of one of pathology images using IRM, in: Proceedings/AMIA Annual
approach, Hum. Pathol. 17 (1986) 216—218. Symposium, 2000, pp. 883—887.
[20] B.N. Nathwani, K. Clarke, T. Lincoln, C. Berard, C. Taylor, [36] M.E. Mattie, L. Staib, E. Stratmann, H.D. Tagare, J. Duncan,
K.C. Ng, R. Patil, M.C. Pike, S.P. Azen, Evaluation of an P.L. Miller, PathMaster: content-based cell image retrieval
expert system on lymph node pathology, Hum. Pathol. 28 using automated feature extraction, J. Am. Med. Inform.
(1997) 1097—1110. Assoc. 7 (2000) 404—415.
[21] M. Das, E. Riseman, B. Draper, FOCUS: searching for [37] W. Press, S. Teukolesky, W. Vetterling, B. Flannery,
multi-colored objects in a diverse imag database, in: IEEE Numerical Recipes in C: The Art of scientific computing,
Conference on Computer Vision and Pattern Recognition, Cambridge University Press, New York, 1992 (Chapter 10).
San Juan, Puerto Rico, 1997, pp. 756—761. [38] R. Duda, P. Hart, D. Stork, Pattern Classification, second
[22] M. Flickner, et al., Query by image and video content: the ed., 2001.
QBIC system, Computer 9 (1995) 23—31. [39] D. Comaniciu, P. Meer, D.J. Foran, Image-guided decision
[23] W. Ma, B. Manjunath, Texture-based pattern retrieval from support system for pathology, Mach. Vis. Appl. 11 (1999)
image databases, Multimedia Tools Appl. 1 (1996) 35—51. 213—224.
[24] M. Ortega, Y. Rui, K. Chakrabarti, S. Mehrotra, T. Huang, [40] J. Mao, A. Jain, Texture classification and segmentation
Supporting Similarity Quries in MARS, ACM Multimedia, using multiresolution simultaneous autoregressive models,
Seattle, Washington, 1997, pp. 403—4131. Pattern Recog. 25 (1992) 173—188.
[25] S. Antani, R. Kasturi, R. Jain, Pattern recognition methods [41] R. Bellman, Adaptive Control Processes: A Guided Tour,
in image and video databases: past, present, and future, in Princeton University Press, 1961.
advances in pattern recognition, in: A. Amin (Ed.), Lecture [42] M. Koeppen, The Curse of Dimensionality Fifth Online World
Notes in Computer Science, Springer-Verlag, London, UK, Conference on Soft Computing in Industrial Applications
1998, pp. 31—53. (WSC5) Held on the Internet, 2000.
[26] C. Faloutsos, R. Barber, M. Flickner, W. Hafner, W. Niblack, [43] H.L. Chen, I. Shimshomi, P. Meer, Model based object recog-
et al., Efficient and effective querying by image content, J. nition by robust information fusion, in: 17th International
Intell. Info. Sys: Integrated Artif. Intell. Database Technol. Conference on Pattern Recognition, Cambridge, UK, 2004.
3 (1994) 231—262. [44] T. Cover, J. Thomas, Elements of Information Theory, John
[27] A. Pentland, R. Picard, S. Sclaroff, Photobook: content Wiley, New York, 1991.
based manipulation of image databases, Int. J. Comp. Vis. [45] G. Wyszecki, W. Stiles, Color Science: Concepts and
18 (1996) 233—254. Methods, Quantitative Data and Formulai, second ed., John
[28] J. Wang, G. Wiederhold, O. Firschein, S. Wei, Wavelet- Wiley & Sons Inc., 1982.
based image indexing techniques with partial sketch [46] D. Comaniciu, P. Meer, Cell image segmentation in
retrieval capability, in: Proceedings of the IEEE Forum on diagnostic pathology, in: J. Suri, S. Singh, K. Setarehdam
Research and Technology Advances in Digital Libraries, ADL, (Eds.), Advanced Algorithmic Approaches to Medial Image
Washington, DC, 1997, pp. 13—24. Segmentation: State-of-The-Art Applications in Cardiology,
[29] C. Carson, M. Thomas, S. Belongie, J. Hellerstein, Neurology, Mammography and Pathology, Springer, 2001,
J. Malik, Blobworld: a system for region-based image pp. 541—558.
indexing and retrieval, Third International Conference on [47] J.J. Berman, Concept-match medical data scrubbing: how
Visual Information Systems, Amsterdam, the Netherlands, pathology text can be used in research, Arch. Pathol. Lab.
1999. Med. 127 (2003) 680—686.