You are on page 1of 8

SKlearn ‫ مكتبة‬:‫القسم العاشر‬

A. Data Preparation 10. Ensemble Classifier


1. Data files from SKlearn 11. K Nearest Neighbors
2. Data cleaning 12. Naïve Bayes
3. Metrics module 13. LDA , QDA
4. Feature selection 14. Hierarchical Clusters
5. Data Scaling 15. DbScan
6. Data Split 16. NLP
17. Apriori
B. ML Algorithms
1. Linear Regression C. Algorithm Evaluation :
2. Logistic Regression 1. Model Check
3. Neural Network 2. Grid Search
4. SVR 3. Pipeline
5. SVC 4. Model Save
6. K-means
7. PCA D. Time Series
8. Decision Tree
9. Ensemble Regression

1
Cleaning Data )1.2
‫ وتكون بهذه الصيغة‬, ‫ او صفر او غيرها‬nan ‫ سواء قيمة‬, @‫و هي تتم عبر إزالة الفراغات‬

# Import Libraries

from sklearn.impute import SimpleImputer


import numpy as np
#----------------------------------------------------

# Cleaning data
'''
impute.SimpleImputer(missing_values=nan, strategy='mean’, fill_value=None, verbose=0, copy=True)
'''
ImputedModule = SimpleImputer(missing_values = np.nan, strategy ='mean')
ImputedX = ImputedModule.fit(X)
X = ImputedX.transform(X)

2
#X Data
print('X Data is \n' , X[:10])

#y Data
print('y Data is \n' , y[:10])

mean , median , most_frequent , constant : ‫ تساوي‬stategy ‫و تكون قيمة‬

3
Example )1.2.2
# Import Libraries
from sklearn.datasets import load_breast_cancer
from sklearn.impute import SimpleImputer
import numpy as np
#----------------------------------------------------

#load breast cancer data


BreastData = load_breast_cancer()

#X Data
X = BreastData.data

#y Data
y = BreastData.target
#----------------------------------------------------
# Cleaning data

'''

4
impute.SimpleImputer(missing_values=nan, strategy='mean’, fill_value=None, verbose=0, copy=True)
'''
ImputedModule = SimpleImputer(missing_values = np.nan, strategy ='mean')
ImputedX = ImputedModule.fit(X)
X = ImputedX.transform(X)

#X Data
print('X Data is \n' , X[:10])

#y Data
print('y Data is \n' , y[:10])

5
1.2.3) Example
import numpy as np

from sklearn.impute import SimpleImputer

data = [[1,2,np.nan],
[3,np.nan,1],
[5,np.nan,0],
[np.nan,4,6 ],
[5,0,np.nan],
[4,5,5]]

imp = SimpleImputer(missing_values=np.nan, strategy='mean')


imp = imp.fit(data)

modifieddata = imp.transform(data)
print(modifieddata)

6
1.2.4) Example
import numpy as np

from sklearn.impute import SimpleImputer

data = [[1,2,np.nan],
[3,np.nan,1],
[5,np.nan,0],
[np.nan,4,6 ],
[5,0,np.nan],
[4,5,5]]

imp = SimpleImputer(missing_values=np.nan, strategy='median')


imp = imp.fit(data)

modifieddata = imp.transform(data)
print(modifieddata)

7
1.2.5) Example
from sklearn.impute import SimpleImputer

data = [[1,2,0],
[3,0,1],
[5,0,0],
[0,4,6],
[5,0,0],
[4,5,5]]

imp = SimpleImputer(missing_values=0, strategy='mean')


imp = imp.fit(data)

modifieddata = imp.transform(data)
print(modifieddata)

You might also like