Professional Documents
Culture Documents
BIOINFORMATICS - ASSIGNMENT 3
c. Change the default gap open and extend from “10/0.5” to “5/0.1” and run the
local alignment. Report identity, similarity & gaps (ratio & percent), then
compare the results with previous local alignment (gap opening & gap extension
based on the alignment output of each set-up).
Gap opening: 10 → 5
Gap extension: 0.5 → 0.11
Identity ((ratio (percent)): 73/206 (35.4%)
Similarity ((ratio (percent)): 107/206 (51.9%)
Gaps ((ratio (percent)): 45/206 (21.8%)
Adjusted parameters
- Gaps: There are more gap openings in the result of adjusted parameters than
the result of default parameters. In general, the results of adjusted parameters
have larger gap extensions than the default parameters.
Question 2: In the NCBI database, retrieve the protein sequences for human
(NP_000509),
Pan troglodytes (XP_508242), Canis familiaris (NP_001257813), Mus musculus
(NP_058652), Gallus gallus (NP_990820). Perform multiple sequence alignment for
these sequences (https://www.ebi.ac.uk/Tools/msa/clustalo/) and investigate the first
60 residues of the alignment. Answer the following questions:
d. Which species have 100% identity when comparing the first 60 residues?