You are on page 1of 1

IT 191T/284T – Intelligent Big Data

Project 2

Machine Learning Prediction


You have learned and practiced extracting useful information from big dataset using regular
tools such as Numpy, Pandas. But you could be able to make predictions only if you apply
Machine Learning (Artificial Intelligence).
Please build Machine Learning models to predict the survival of Titanic passengers. Try different
models as many as you could: Stochastic Gradient Descent (SGD), Logistic Regression, k-
Nearest Neighbor, Gaussian Naive Bayes, Support Vector Machine, Decision Tree,
Random Forest, and compare the results. You will need to use almost all you have learned from
this course: Python, NumPy, Pandas, MatPlotLib, Seaborn, Scikit-Learn.
This is a “practice” project. You are not expected to “create” anything. The analysis procedure,
machine learning model, and Python scripts can be found from this artical, and the link to the
dataset is found there as well. Please spend time to get yourself fully practiced. This is the best
preparation for completing your Final Projects.
Write your Project 2 report to record what you have tried and done (Python code plus your
understanding and detailed explanations as comments).
Have Fun!

DR. DAMING ZHANG, DEPT OF IND TECH, FRESNO STATE U 1

You might also like