Professional Documents
Culture Documents
Structured
Data
Numerical / Quantitative
Structured Discrete
Continue
Relation : ?
Positive Correlation
Negative Correlation
Inno_DS_11.30 Page 1
Types of Negative Correlation
Inno_DS_11.30 Page 2
EMPID EMPNAME EMPAGE EMPEXP EMPTECH EMPSAL
Regression
Relationship between independent & Dependent variable is
called regression
Inno_DS_11.30 Page 3
Best fit line : A line which passes through maximum data point
and very close to other data point at the same time. Is called best
fit line
Data Acquisition
Feature Selection
Inno_DS_11.30 Page 4
Feature Selection
Error Detection
Mean Std
Zscore
Percentile
IQR
Encoding
Nominal Categorical
Get Dummies
Ordinal Categorical
Ordinal Encoder
Map- lambda
Apply - lambda
Label Encoder
Imbalance
Data Separation
Data Splitting
Model Building
Training model
Evaluation Model
Prediction
Inno_DS_11.30 Page 5