Professional Documents
Culture Documents
Similarity Similarityand andhomology homology Pairwise Pairwisesequence sequencealignment alignment Dot Dotmatrices matrices ORFs ORFs The TheSubstitution Substitutionmatrix matrix Gapless Gaplessalignment alignment
identical
AAGACGTTTA GACGTACT
All diagonals with at least 4 out of 5 matches.
In class exercise:
In class exercise:
In class exercise:
Finding genes
The DNA sequence does not tell us where the genes are. [Genes are segments of DNA that are transcribed into RNA.] However, it does tell us where the open reading frames (ORFs) are. We can find ORFs by looking for regions that have no STOP codons. If we think it is a protein gene, then we can also find the translation start site ATG.
In class exercise:
In class exercise:
In class exercise:
Alignment matrix
To prepare an alignment, we first consider the score for aligning any one character of the first sequence to one character of the other sequence (one association, one match)
A 0 1 0 0 0 1 0 0 A 0 1 0 0 0 1 0 0 G 1 0 0 1 0 0 0 0 A 0 1 0 0 0 1 0 0 C 0 0 1 0 0 0 1 0 G 1 0 0 1 0 0 0 0 T 0 0 0 0 1 0 0 1 T 0 0 0 0 1 0 0 1 T 0 0 0 0 1 0 0 1 A 0 1 0 0 0 1 0 0 G A C G T A C T
Conservative mutations
DNA: A change in the 3rd base in a codon, and sometimes the first base, sometimes conserves the amino acid. Protein: A change in amino acids that are in the same chemical class conserve their chemical environment. For example: Lys to Arg is conservative because both a positively charged.
+
N` N C C O C C C C
N`
N C C C C C
+
N`
C C C C C O
N C C C C C C
If the chemistry of the sidechain is conserved, then the mutation is less likely to change structure/function.
non-polar
polar
polar/charged
Each number is the score for aligning a single pair of amino acids.
What is the score for this alignment?: ACEPGAA ASDDGTV
BLOSUM62
A teacher's dilemma
To understand... You first need to know...
Multiple sequence alignment Substitution matrices Substitution matrices Phylogenetic trees Phylogenetic trees Multiple sequence alignment
Each diagonal represents a different alignment, whose score is the sum of the boxes.
sequence 2
In class exercise:
Gapless Alignment
Y K K G E R
G D I
Fill in each box using BLOSUM62. Score all diagonals.
Each diagonal represents a different alignment, whose score is the sum of the boxes.
K R
still time?
(2) Loop over boxes (nested loops). Put the appropriate BLOSUM number in the box.
still time?