You are on page 1of 2

CS 460 INTRODUCTION TO BIOINFORMATICS

Assignment 3
Due Date: 8th November 2019, 10am
Copy cases will result in an automatic ‘F’ grade and will be reported to campus disciplinary
committee. You can consult with others and the internet but cannot copy/paste the entire text.
Properly reference the source that you consulted (name/roll no of student, web address of the site).

Q1. Given the values {2341, 4234, 2839, 430, 22, 397, 3920}, a hash table of size 7, and hash
function h(x) = x mod 7, show the resulting tables after inserting the values in the given order.
Solve the collision with separate chaining. [5 marks]

Q2. Run the k-means clustering on the following data. Use k =2. The initial cluster centers are the
first 2 data points. [10 marks]
1.3, 1.1
1.3, 0.2
0.6, 2.8
3.0, 3.2
1.2, 0.7
1.4, 1.6
1.2, 1.0
1.2, 1.1
0.6, 1.5
1.8, 2.6
1.2, 1.3
1.2, 1.0
0.0, 1.9

Q3. Assume we have found a mutation Try178Gly in multiple aligned sequences of a protein. To
check whether this mutation is pathogenic or neutral what analysis would you run. How do you
think this will affect the structure and function of protein. Explain briefly [5 marks]

You might also like