Professional Documents
Culture Documents
of Machine Learning
Tasnim Jahan Chowdhury
(ID 2014-1-60-006)
The sinking of the Titanic is one of the most historic shipwrecks of all time. The tragedy killed
thousands, 1502 out of 2224 passengers, and led many wondering what could have been done better.
One of the most important reason is that there was not enough lifeboats, and although there was
probably quite amount of luck involved, there were some groups of people that were more likely to
survive than others.
Background
6
Proposed Method
Apply Data Algorithm
Algorithm
Feature B
Engineering A
Data
Training
Pre-
Processing
Apply Algorithm
Machine C
Learning
Data Technique
Set Processed
Data Testing
Prediction
Best Accuracy
7
Data Set
• Dataset we collect from kaggle website. In this dataset, we have two
subcategories that are training and testing data. Training dataset
contains 12 columns which are the features of the data set and 891
rows that are the data points. Same goes for the testing data set
consisting of 418 passengers which has all these columns except
“survived” because we will predict this with algorithms.
Attribute Information
Survived
Survived is the target variable
Visualize Normalization
. Other cleaning
Model Creation
13
Statistical Analysis of Dataset
14
Statistical Analysis of Dataset
15
Comparison
Algorithms Accuracy
SVM 83.5 %.
83.5%
17
Comparison with other paper
• Although many other researchers have worked on it to define the
actual cause of the survival of some passengers. We use various
different combinations of features and different machine learning
methods and try to show get better results and accuracy.
Comparison with other paper
Lam and Tang Our project
•In this paper, we have done our analysis with few datasets. In near future we will try to
analyze with more number of datasets it would be interesting to play more with the
dataset and introducing more attributes which might lead to good results in the future.
23
Thank You to our
honourable faculties.