Professional Documents
Culture Documents
NLP Midexam Summer2020
NLP Midexam Summer2020
Question 2: Write pseudocode for the K-Nearest neighbors classifier. [10 Marks]
Question 3: Convert the following dataset into frequency based representation. [10 Marks]
<s> Cricket Test Pakistan </s>
<s> England Broad Wickets</s>
<s> Test Wickets Cricket </s>
Question 4: Perform K-means clustering on a dataset having 5 documents. Keep the value of k = 2. Perform
at least 2 iterations. [10 Marks]
X Y
D1 3 1
D2 4 2
D3 6 4
D4 5 2
D5 2 3