Professional Documents
Culture Documents
With the advent of advanced computing techniques, FISH has the potential to transform the field of clinical diagnostics
through computational modelling and image pattern analysis.
1
A single-stranded DNA molecule made up of fluorophore-bound nucleotides.
2
A gene that regulates normal cell growth and as such, can cause cancer (uncontrolled cell growth) when mutated.
3
A promotor i.e. a specific DNA sequence that initiates gene expression.
An AI accomplishes its intended goal using a set of instructions called algorithms. AI algorithms can process and
subsequently learn from data in a number of ways. If the system is trained on a dataset that has a known correct output, this
is called supervised learning; if it is trained on data alone without human intervention, it is known as unsupervised learning
[ CITATION Nic19 \l 1033 ].
Another method of ‘teaching’ the AI system is through reinforcement learning where the system figures out how to make a
sequence of decisions that ultimately result in accomplishing a goal in a completely unknown environment. Reinforcement
occurs through a system of rewards and penalties that encourage the AI algorithm to learn through a succession of trial and
error.
A set of AI algorithms working together are known as models. Machine Learning (ML) models involve a wide array of
algorithms that apply statistical techniques to a large dataset with unique data points. A ML system is designed to
automatically improve its analytical ability with every analysis. Another model is known as deep learning and it mimics the
way a human neuron processes information to identify and then amplify recognizable patterns in a large dataset. This is
called a neural network and it involves processing data through a network of interconnected nodes that provide a final
result[ CITATION Pan18 \l 1033 ].
A combination of these models and techniques has allowed AI algorithms to accurately predict outcomes when given
enough quantifiable data – sometimes with nearly human accuracy. This is invaluable in multiple fields, especially clinical
diagnostics and medical analytics.
Big Data
Big Data is a term used to describe immense datasets with multiple sources that often keeps adding data in real time.
Conventional databases and data processing softwares are unable to handle the ever-expanding volume of Big Data
[ CITATION IBM \l 1033 ]. Consequently, novel and advanced analytical methods are needed to identify patterns,
comprehend trends and predict outcomes. These include machine learning, deep learning, predictive analytics, natural
language processing and data mining models that involve linear regression, naïve Bayes, kNN[ CITATION WuX08 \l
1033 ][ CITATION XWu14 \l 1033 ].
However, Ostromohov et al. (2018) recently presented a procedure that can measure the real time hybridization rate for any
probe and target sequence by constantly delivering probes to analyzed cells and siphoning away unhybridized probes
immediately. With this newfound ability to quantify hybridization conditions such as probe concentration, sequence, length
and types; volume exclusion and destabilizing agents content; ionic strength and rate-enhancing agents, more accurate
computational models can be developed that will ultimately lead to more efficient probe designs and optimized FISH
assays[CITATION Nad18 \l 1033 ].
Another pertinent development is the emergence of thermodynamic models that increasingly predict highly accurate
temperatures for the denaturation of the DNA double helix (known as melting), a critical value for molecular techniques like
FISH, Polymerase Chain Reaction (PCR), microarrays etc. For FISH however, multiple chemical reagents are used, and
different parameters need to be considered than needed for PCR or other molecular biology techniques. So it becomes
imperative to modify existing DNA/RNA based thermodynamic models. Melting temperature is also not always the optimal
hybridization temperature, which is why there is a great of deal of customization that is still required to optimize current
thermodynamic models for FISH assays [ CITATION Fon15 \l 1033 ].
Automated FISH Analysis
Biological pattern recognition combines machine learning with optical biological assays or biomedical imaging to provide
reliable outcomes. The eventual goal is to reach consistently accurate high-throughput imaging analysis on par with or even
exceeding human accuracy. This can be accomplished through both supervised machine learning where the system is
trained on examples or unsupervised learning where the system learns to cluster relevant data using pre-set rules
[ CITATION Sha10 \l 1033 ][ CITATION Che18 \l 1033 ].
Automated cytogenetic analysis techniques have been in use since 1980’s, mainly for karyotyping. A system enhanced
images, segmented and aligned chromosomes and compared feature vectors in the images with stored feature vectors of
standard chromosomes. This allowed the automatic diagnosis of abnormal chromosomal patterns and distributions
[ CITATION Bri07 \l 1033 ].
Modern image processing and pattern recognition techniques have a come a long way from those rudimentary beginnings
and have evolved into a variety of applications. Lerner et al. (2001) have used a neural network classifier to optimize and
sharpen FISH images by improving discrimination between the actual fluorescence of the hybridized probes and the
reflected or background fluorescence scattered by the 3-dimensional cell debris [ CITATION Ler01 \l 1033 ]. Nandi et al.
(2009) have automated nuclei segmentation, the first (and usually the longest) step of analyzing tissue images. They used a
hybrid, data-driven segmentation algorithm in concert with intelligent, supervised pattern classification system which learns
by collating features of manually segmented nuclei as well as the outcomes of multiple classifiers [ CITATION Nan09 \l
1033 ]. Perhaps the most promising results is the successful application of deep learning (through a convolutional neural
network) for high-throughput image processing of DNA FISH signals by Gudla et al (2017). The workflow, dubbed
‘SpotLearn’ has proven accuracy even for images with low signal and high noise ratio [CITATION Gud17 \l 1033 ].
These developments in advanced computing techniques applied to biological pattern recognition are important for not only
clinical diagnosis, but also risk prediction. Wang et al. (2010) reported a 92-98% agreement between a trained
cytogeneticist and automated FISH analysis system for early stage cervical cancer diagnosis [ CITATION Wan10 \l 1033 ].
A more recent study that used deep learning to identify HER-2 gene amplification in breast cancer reported accuracy
validated by 3 independent pathologist and a set of 57 reference images [ CITATION Zak19 \l 1033 ]. While further work
on pattern recognition and image processing still needs to be done, the future of automated FISH analytics has proven to be
promising.
References
[2] L. Kearney, "The impact of the new FISH technologies on the cytogenetics of haematological malignancies," British
Journal of Haematology, pp. 648-658, 2001.
[3] J. H. C. H. &. B. M. Nichols, "Machine learning: applications of artificial intelligence to imaging and diagnosis,"
Biophysical Reviews, p. 111–118 , 2019.
[4] T. S. P. &. A. R. Panch, "Artificial intelligence, machine learning and health systems.," Journal of global health,
2018.
[6] X. K. V. R. Q. J. Wu, "Top 10 algorithms in data mining," Knowledge and Information Systems, pp. 1-37, 2008.
[7] X. Z. G. W. a. W. D. X. Wu, "Data mining with big data," IEEE Transactions on Knowledge and Data Engineering,
pp. 97-107, 2014.
[8] N. Ostromohov, D. Huber, M. Bercovici and G. V. Kaigala, "Real-Time Monitoring of Fluorescence in situ
Hybridization (FISH) Kinetics," Analytical Chemistry, pp. 11470-11477, 2018.
[10] L. Shamir, J. D. Delaney, N. Orlov, D. Eckley, Mark and I. G. Goldberg, "Pattern Recognition Software and
Techniques for Biological Image Analysis," 2010.
[11] C. J. H. T. M. e. a. Chen, "Unsupervised Learning and Pattern Recognition of Biological Data Structures with Density
Functional Theory and Machine Learning.," Nature Scientific Reports, p. 557, 2018.
[12] A. P. a. R. G. Britto, "A Review of Cytogenetics and its Automation," Journal of Medical Sciences, pp. 1-18, 2007.
[13] B. &. C. W. &. D. S. &. H. M. &. B. C. Lerner, "Automatic signal classification in fluorescence in situ hybridization
images.," Cytometry, 2001.
[14] G. P. M. K. M. T. L. S. Nandy K, "Automatic nuclei segmentation and spatial FISH analysis for cancer detection.,"
Conf Proc IEEE Eng Med Biol Soc., 2009.
[15] P. R. N. K. P. G. &. M. T. Gudla, "SpotLearn: Convolutional Neural Network for Detection of Fluorescence In Situ
Hybridization (FISH) Signals in High-Throughput Imaging Approaches," Cold Spring Harbor Symposia on
Quantitaive Biology, 2017.
[16] Z. B. Z. R. e. a. Wang X, "Automated analysis of fluorescent in situ hybridization (FISH) labeled genetic biomarkers
in assisting cervical cancer diagnosis," Technol Cancer Res Treat. , pp. 231-242, 2010.
[17] F. d. B. W. W. M. e. a. Zakrzewski, "Automated detection of the HER2 gene amplification status in Fluorescence in
situ hybridization images for the diagnostics of cancer tissues.," Sci Rep , 2019.