Professional Documents
Culture Documents
ISSN 2229-5518
Keywords: Breast cancer, early detection, Tumor suppressor genes, BRCA, blood sample, PCR method, DNA
sequencing, gene sequence, Local sequence alignment algorithm, Smith waterman.
breast cancer susceptibility gene 2, respectively. The them into smaller pieces by inserting spaces in one
BRCA genes belong to a class of genes known as or the other so that identical subsequences are
tumor suppressor genes [10]. Like many other eventually aligned in a one-to-one correspondence
tumor suppressors, the protein produced from the naturally, spaces are not inserted in both sequences
BRCA genes helps prevent cells from growing and at the same position. The objective of sequence
dividing too rapidly or in an uncontrolled way. alignment is to match identical subsequences as far
There is no strong homology between BRCA1 and as possible. However, if the sequences are not
BRCA2, although both genes have a large exon 11 identical, mismatches are likely to occur as different
which seems to be crucial for function. However, letters are aligned together. The insertion of spaces
the function of the two genes seems to be similar produced gaps in the sequences. They are important
[14, 20]. The BRCA genes provides instructions to allow a good alignment between the characters of
for making a protein that is directly involved in sequences. A gap in the first sequence is considered an
repairing damaged DNA. By helping repair insertion of a character from the second sequence into
DNA, BRCA 1 plays a role in maintaining the the first one, whereas a gap in the second sequence is
stability of a cell's genetic information [13]. It considered a deletion of a character of the first
is identified that more than 1,000 mutations in the sequence.
both genes have a large exon 11 which seems to be
crucial for function. However, the function of the Once the alignment is produced, a score | can be
two genes seems to be similar [14, 20]. The assigned to each pair of aligned letters, called aligned
BRCA genes provides instructions for making a pair, according to a chosen scoring scheme. The
protein that is directly involved in repairing similarity of two sequences can be defined the
damaged DNA. By helping repair DNA, BRCA best score among all possible alignments between
plays a role in maintaining the stability of a cell's them. Sequence comparison is actually a well-
genetic information [13]. know problem in computer science.
Computational approaches to sequence alignment
It is identified that more than 1,000 generally fall into two categories: global
mutations in theBRCA1 gene and 800 mutations alignments and local alignments. Pair wise
in the BRCA2 gene are possible, many of which sequence alignment methods are used to find the
are associated with an increased risk of breast best-matching piecewise (local) or global
cancer. Most of these mutations lead to the j alignments of two query sequences. Pair wise
production of an abnormally short version of the alignments can only be used between two
BRCA1 protein, or prevent any protein from sequences at a time, but they are efficient to
being made from one copy of the gene. Other calculate and are often used for methods that do
BRCA1 mutations change single j protein building not require extreme precision (such as searching
blocks (amino acids) in the protein or delete large a database for sequences with high similarity to a
segments of DNA from the BRCA1 gene. Many query). The three primary methods of producing
BRCA2 mutations insert or delete a small number pair wise alignments are dot-matrix methods,
of DNA building blocks (nucleotides) in the gene. dynamic programming, and word methods.
Researchers believe that a defective or missing
BRCA1 protein is unable to help repair damaged Global alignment is achieved using the
DNA or fix mutations that occur in other |genes. Needleman-Wunsch algorithm. The algorithm it
As these defects accumulate, they can allow cells tries to take all of one sequence and align it with
to | grow and divide uncontrollably and form a tumor all of a second sequence. Short and highly
[8, 9]. similar subsequences may be missed in the
alignment because they are outweighed by the rest
4. SEQUENCE COMPARISON of the sequence. Hence, one would like to create
a locally optimal alignment [18]. Local
Sequence comparison can be defined as the alignments are more useful for dissimilar
problem of J finding which parts of the sequences
sequences that are suspected to contain regions of
are similar and which parts are different. Generally,
similarity or similar sequence motifs within their
a measure of how similar they are is also desirable. A
typical approach to solve this problem is to find a larger sequence context. The Smith-Waterman
good and plausible alignment between the two algorithm is a general local alignment method also
sequences. Then, given an appropriate scoring based on dynamic programming. The dynamic
scheme, their similarity can be computed. Generally, programming approach to pair wise sequence
sequence comparisons involve aligning sections of alignment is guaranteed to provide the optimal
the two sequences in a way that exposes the global or local pair wise alignment and score given
similarities between them [7]. The idea of a particular scoring scheme [1]. In smith waterman
aligning two sequences (of possibly different algorithm,
sizes) is to write one on top of the other, and break 1. All symbols (residues) in the two
IJSER © 2010
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 1, Issue 2, November-2010 3
ISSN 2229-5518
5. PROPOSED SYSTEM
IJSER © 2010
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 1, Issue 2, November-2010 4
ISSN 2229-5518
7. CONCLUSION
8. FUTURE SCOPE
IJSER © 2010
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 1, Issue 2, November-2010 5
ISSN 2229-5518
smith waterman algorithm. The future work can [17]. "Smith Waterman algorithm" Oct. 4, 2007
target to use the upgraded Smith waterman [18]. Wikipedia, "Smith Waterman algorithm,"
algorithm, that has reduced computational WikipediaH April 2010.
complexity to (N*(M+l)/2) and less size and [20]. Wisegeek, "What is a tumor suppressor gene
space complexity. Moreover, risk level of cancer wisegeek, 2009.
can also be identified in further computational
analysis.
ABBREVATIONS
REFERENCES
[1]. EC Rouchka "Aligning DNA sequencing using
Dynamic Programming",ACM, 2006.
[2]. American Cancer Society, "Breast Cancer
Facts and Figures 2009-2010", American Cancer
Society, 2009.
[3]. American Cancer Society, "What is Breast
Cancer", American Cancer Society, Sep. 18, 2009.
[4]. Baylor college of Medicine HGSC, "Smith
waterman algorithm," Baylor college of Medicine
HGSC, Aug.01, 2002.
[5]. Breast Cancer, "Stages of Breast Cancer",
Breast Cancer, Jan.21, 2010.
[6]. David W Mount, Bioinformatics: Sequence
and genome analysis, 2nd ed, NY: Cold spring
horbor laboratory press, 2000.
[7]. Eugene W. Myers, "An Overview of Sequence
Comparison Algorithms in Molecular Biology,"
Department of Computer Science, The University
of Arizona, Arizona, Tech Rep 91-29, December
20,1991.
[8]. Genetic Home Reference, "BRCA1", Genetic
Home Reference, Aug, 2007
[9]. Genetic Home Reference, "BRCA2", Genetic
Home I Reference, Aug, 2007.
[10]. National Cancer Institute, "BRCA I and BRCA2:
Canarl Risk and Genetic Testing" National Cancer
Institute, May.29, 2009.
[11]. "Overview of steps in DNA Sequencing".
[Online]. .Apr.6 2010.
[12]. P. Sharma et al, "Early detection of breast
cancer base on gene-expression patterns in
peripheral blood cells," Breast cancer research, p.
634+, Jun 2005.
[13]. Ralph Scully, "Role of BRCA gene dysfunction
in breast and ovarian cancer predisposition," Breast
Cancer research, July 2000.
[15]. S. A. de Carvalho Junior," Sequence
Alignment Algorithms," M.S. thesis, King.s College
London, University for London, London, September
2003.
[16]. S. Das and D.Dey, "A new algorithm for
localalignment in DNA sequencing, "in IEEE India
National conference, 2004, pp 410-413.
IJSER © 2010
http://www.ijser.org