You are on page 1of 1

Lab assignment 1: Data Science (IT258M)

1. Using the dataset of emails:


https://www.kaggle.com/venky73/spam-mails-dataset
(You need to register in Kaggle to download the data), implement a Naive Bayes-based
spam-ham (ham: is non-spam email) filter.

2. Plot the confusion matrix, analyze the results, and justify the reasoning behind these
results.

You might also like