Welcome to Scribd!

Machine Learning Practical-1 Aim:: S5Hyy5Pbnxpdhxnedozymvimzbizwiwzthjmdm3

Uploaded by

0% found this document useful (0 votes)

64 views4 pages

The document discusses preprocessing two datasets for machine learning. For the first dataset, it replaces missing numerical values with the mean and encodes categorical country values. For the second dataset, it one-hot encodes the outlook categorical variable. The Python code takes each dataset, preprocesses the data using techniques like imputation, label encoding, and one-hot encoding, and prints the transformed datasets.

Original Description:

Original Title

ML-Prac-1 Student

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

64 views4 pages

Machine Learning Practical-1 Aim:: S5Hyy5Pbnxpdhxnedozymvimzbizwiwzthjmdm3

Uploaded by

Devanshi Parejiya

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

1

Machine Learning
Practical-1
Aim:
1. Understanding of Data Pre-processing for given dataset 1 using Spyder (Python)
2. Replace Missing values by below imputation strategy.

If "mean", then replace missing values using the mean along the axis.
If "median", then replace missing values using the median along the axis.
If "most_frequent", then replace missing using the most frequent value along the
axis.

Ref.
https://docs.google.com/viewer?a=v&pid=sites&srcid=Z2FucGF0dW5pdmVyc2l0e
S5hYy5pbnxpdHxneDozYmViMzBiZWIwZThjMDM3

Predicated Output:

3. Understanding of categorical data.

4. Replace Country Attribute for given dataset 1 by fit_transform method.
Predicated Output:
2

5. Replace categorical and numerical attributes for given dataset 2 by OneHotEncoder Class.
Predicated Output:

Dataset 1:
Country Age Salary Purchased
France 44 72000 No
Spain 27 48000 Yes
Germany 30 54000 No
Spain 38 61000 No
Germany 40 NaN Yes
France 35 58000 Yes
Spain NaN 52000 No
France 48 79000 Yes
Germany 50 83000 No
France 37 67000 Yes

Dataset2:
Outlook Outlook0 Outlook1
Sunny 1 Sunny
Sunny 2 Sunny
Overcast 3 Overcast
Rain 4 Rain
Rain 5 Rain
Rain 2 Rain
Overcast 3 Overcast
Sunny 1 Sunny
Sunny 2 Sunny
Rain 3 Rain
Sunny 4 Sunny
Overcast 2 Overcast
Overcast 1 Overcast
Rain 3 Rain
3

Python Code: 1

# Data Preprocessing

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset

dataset = pd.read_csv('dataset1.csv')
X = dataset.iloc[:, :-1].values
y = dataset.iloc[:, 3].values

# Taking care of missing data

#from sklearn.preprocessing import Imputer

from sklearn.impute import SimpleImputer

imputer = SimpleImputer(missing_values = np.nan, strategy = 'mean')
imputer = imputer.fit(X[:, 1:3])
X[:, 1:3] = imputer.transform(X[:, 1:3])

from sklearn.preprocessing import LabelEncoder

labelencoder_X = LabelEncoder()
X[:, 0] = labelencoder_X.fit_transform(X[:, 0])
print(X)
4

Python Code: 2

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset

dataset = pd.read_csv('dataset2.csv')
X = dataset.iloc[:, :].values
#y = dataset.iloc[:, 3].values

# Encoding categorical data

from sklearn.preprocessing import LabelEncoder, OneHotEncoder
from sklearn.compose import ColumnTransformer

transform = ColumnTransformer([("Outlook_OL0_OL1",OneHotEncoder(),[0,2])],
remainder = 'passthrough')
X = transform.fit_transform(X)

transform = ColumnTransformer([("Outlook_OL0_OL1",OneHotEncoder(),[6])],
remainder = 'passthrough')
X = transform.fit_transform(X)
print(X.astype(int))

French Demystified, Premium 3rd Edition
From Everand
French Demystified, Premium 3rd Edition
Annie Heminway
Rating: 3.5 out of 5 stars
3.5/5 (3)
2019 Stat AP
Document96 pages
2019 Stat AP
Jinyu Lee
100% (1)
Hands-On Neural Networks
Document346 pages
Hands-On Neural Networks
naveen441
100% (2)
Mth603 AlotofSolved MCQSforFinalTerm Exam
Document14 pages
Mth603 AlotofSolved MCQSforFinalTerm Exam
Noreen Ahmed
33% (3)
Python Data Analytics: With Pandas, NumPy, and Matplotlib
From Everand
Python Data Analytics: With Pandas, NumPy, and Matplotlib
Fabio Nelli
Rating: 2 out of 5 stars
2/5 (1)
Summary of Jimmy Song's Programming Bitcoin
From Everand
Summary of Jimmy Song's Programming Bitcoin
IRB Media
No ratings yet
A Student's Guide to Python for Physical Modeling: Second Edition
From Everand
A Student's Guide to Python for Physical Modeling: Second Edition
Jesse M. Kinder
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Learning C with Fractals
From Everand
Learning C with Fractals
Roger T. Stevens
No ratings yet
Practical Neural Network Recipies in C++
From Everand
Practical Neural Network Recipies in C++
Masters
Rating: 3.5 out of 5 stars
3.5/5 (5)
Ml-Prac 1: A V&Pid Sites&Srcid Z2Fucgf0Dw5Pdmvyc2L0Es5Hyy5Pbnxpdhxnedozymvimzbiz Wiwzthjmdm3
Document2 pages
Ml-Prac 1: A V&Pid Sites&Srcid Z2Fucgf0Dw5Pdmvyc2L0Es5Hyy5Pbnxpdhxnedozymvimzbiz Wiwzthjmdm3
Devanshi Parejiya
No ratings yet
(Lec 6) Decision Tree ML
Document26 pages
(Lec 6) Decision Tree ML
Muhtasim Jawad Nafi
No ratings yet
Lab 08: ID3 - Decision Tree and Linear Regression Objectives
Document4 pages
Lab 08: ID3 - Decision Tree and Linear Regression Objectives
zombiee hook
No ratings yet
Government Engineering College, Modasa: B.E. - Computer Engineering (Semester - VII) 3170724 - Machine Learning
Document3 pages
Government Engineering College, Modasa: B.E. - Computer Engineering (Semester - VII) 3170724 - Machine Learning
ronak
No ratings yet
Complexity of Algorithms Analysis, Stack, Queue, Tree and Binary Tree
Document39 pages
Complexity of Algorithms Analysis, Stack, Queue, Tree and Binary Tree
Alief
No ratings yet
(Slides) Module 11
Document103 pages
(Slides) Module 11
Gladwin Tirkey
No ratings yet
Dwdm-Lab Manual
Document39 pages
Dwdm-Lab Manual
sivavenkatkumar34
No ratings yet
KNN
Document5 pages
KNN
Uday Kumar
No ratings yet
Government Engineering College Modasa Semester:7 (C.E) Machine Learning (3170724)
Document27 pages
Government Engineering College Modasa Semester:7 (C.E) Machine Learning (3170724)
Bhargav Bharadiya
No ratings yet
Machine Learning
Document22 pages
Machine Learning
tina
No ratings yet
Matlab Tutorial5
Document50 pages
Matlab Tutorial5
Asterix
100% (8)
FORECASTING
Document37 pages
FORECASTING
Regina Jazzmim Quezada
No ratings yet
Bca-Machine Learning With R
Document8 pages
Bca-Machine Learning With R
ÃÑŠHÜ
No ratings yet
6 XG Boost - Jupyter Notebook
Document3 pages
6 XG Boost - Jupyter Notebook
venkatesh m
100% (1)
Lab1 Monte
Document15 pages
Lab1 Monte
姚逸凡
No ratings yet
OS Assignment III
Document4 pages
OS Assignment III
ritesh sinha
No ratings yet
Identification of Structures From Powder X-Ray Diffraction Data
$Identification of Structures From Powder X-Ray Diffraction Data$
Document4 pages
Identification of Structures From Powder X-Ray Diffraction Data
Carla Parra
No ratings yet
Imp Questions For Ci - Update
Document8 pages
Imp Questions For Ci - Update
rajeshwari64.y
No ratings yet
Lab Mannual
Document49 pages
Lab Mannual
vickyakfan152002
No ratings yet
Data Mining 2020
Document2 pages
Data Mining 2020
waleed
No ratings yet
Deep Learning Fundamentals Materials
Document216 pages
Deep Learning Fundamentals Materials
イタチ君イタチ君
No ratings yet
PAIFile2023 Final
Document48 pages
PAIFile2023 Final
Rohan 7
No ratings yet
Data Science Chapitre 1
Document54 pages
Data Science Chapitre 1
Leonel Ska
No ratings yet
Time Complexity Analysis
Document30 pages
Time Complexity Analysis
ADEDE Ezéchiel
No ratings yet
Algorithms and Its Comparisons
Document21 pages
Algorithms and Its Comparisons
Fizza Irfan
No ratings yet
UNIT-1 2 Mark Questions
Document11 pages
UNIT-1 2 Mark Questions
Dileep Yenuganti
No ratings yet
Ann MPDM Ii
Document42 pages
Ann MPDM Ii
sidra shafiq
No ratings yet
Example: Proton Treatment Plan With Subsequent Isocenter Shift
Document21 pages
Example: Proton Treatment Plan With Subsequent Isocenter Shift
Edis Đedović
No ratings yet
Accurate Multiple-Precision Gauss-Legendre Quadrature - Laurent Fousse - July 2007
Document9 pages
Accurate Multiple-Precision Gauss-Legendre Quadrature - Laurent Fousse - July 2007
RMolina65
No ratings yet
NN Lab2
Document5 pages
NN Lab2
Anne Wanningen
No ratings yet
PAI Practicle
Document16 pages
PAI Practicle
Rohan 7
No ratings yet
Gradient Descent
Document5 pages
Gradient Descent
Mark Schoolwork
No ratings yet
Lab 08 - Data Preprocessing
Document9 pages
Lab 08 - Data Preprocessing
rida
No ratings yet
Big Assignment 2
Document10 pages
Big Assignment 2
melesse bisema
No ratings yet
Stat 302 Practice Final: Brad Mcneney 2017-04-15
Document7 pages
Stat 302 Practice Final: Brad Mcneney 2017-04-15
Siiroostaiii Koomiaar
No ratings yet
Sclab
Document36 pages
Sclab
sarigasemmalai
No ratings yet
Foundations of Probability in Python - Part 3
Document55 pages
Foundations of Probability in Python - Part 3
Mohamed Gaber
No ratings yet
PAIFile 2023
Document48 pages
PAIFile 2023
Rohan 7
No ratings yet
Classification: Decision Trees
Document30 pages
Classification: Decision Trees
Ashish Tiwari
No ratings yet
Lecture 5: Algorithm Design and Time/space Complexity Analysis
Document54 pages
Lecture 5: Algorithm Design and Time/space Complexity Analysis
pranali suryawanshi
No ratings yet
SBF Group Assignment PDF
Document9 pages
SBF Group Assignment PDF
Ai Tien Tran
No ratings yet
Chapter#03 Supervised Learning and Its Algorithms - III
Document29 pages
Chapter#03 Supervised Learning and Its Algorithms - III
Muhammad Huzaifa
No ratings yet
Intro To Machine Learning With PyTorch
Document48 pages
Intro To Machine Learning With PyTorch
Ovidiu Toma
No ratings yet
Chapter VI
Document9 pages
Chapter VI
Tadesse Bitew
No ratings yet
MR - Bond Ka Aashirvad
Document29 pages
MR - Bond Ka Aashirvad
6109 Prashant Godhe
No ratings yet
Homework 3 Association Rule Mining
Document3 pages
Homework 3 Association Rule Mining
م. سهير عبد داؤد عسى
No ratings yet
Praktikum Modul 3
Document5 pages
Praktikum Modul 3
Juki Agus Riyanto
No ratings yet
Imperative Programming HW1
Document3 pages
Imperative Programming HW1
Beauty Market
No ratings yet
Portfolio Optimization
Document22 pages
Portfolio Optimization
भोला भण्डारी
No ratings yet
Hamming Code Trainer
Document38 pages
Hamming Code Trainer
Ashwani Kumar Yadav
No ratings yet
Previous Year Paper - Sem 7
Document12 pages
Previous Year Paper - Sem 7
tebade8363
No ratings yet
Statistical Analysis of Operational Risk Data
From Everand
Statistical Analysis of Operational Risk Data
Giovanni De Luca
No ratings yet
U.V. Patel College of Engineering Department of Computer Engineering and Information Technology Subject: Big Data Analytics (2IT709) LAB-1 Task 1
Document5 pages
U.V. Patel College of Engineering Department of Computer Engineering and Information Technology Subject: Big Data Analytics (2IT709) LAB-1 Task 1
Devanshi Parejiya
No ratings yet
Practical-1: 2IT702: Artificial Intelligence Practical-1
Document2 pages
Practical-1: 2IT702: Artificial Intelligence Practical-1
Devanshi Parejiya
No ratings yet
Ml-Prac 1: A V&Pid Sites&Srcid Z2Fucgf0Dw5Pdmvyc2L0Es5Hyy5Pbnxpdhxnedozymvimzbiz Wiwzthjmdm3
Document2 pages
Ml-Prac 1: A V&Pid Sites&Srcid Z2Fucgf0Dw5Pdmvyc2L0Es5Hyy5Pbnxpdhxnedozymvimzbiz Wiwzthjmdm3
Devanshi Parejiya
No ratings yet
ML-Prac 2 - AIM
Document2 pages
ML-Prac 2 - AIM
Devanshi Parejiya
No ratings yet
Ai Question Paper2
Document2 pages
Ai Question Paper2
kalyanram19858017
No ratings yet
Power & Energy Signals Questions and Answers - Sanfoundry
Document11 pages
Power & Energy Signals Questions and Answers - Sanfoundry
kshambelmekuye
No ratings yet
Problem On Flash Calculation
Document5 pages
Problem On Flash Calculation
Dixit Sabhani
No ratings yet
Applications of Group Theory in Cryptography and Coding Theory
Document11 pages
Applications of Group Theory in Cryptography and Coding Theory
Chandra Sekhar Akkapeddi
100% (1)
14.1 Bit Manipulation I - Apni Kaksha
Document5 pages
14.1 Bit Manipulation I - Apni Kaksha
Pankaj Kumar-76
No ratings yet
Four-State Trajectory-Tracking Control Law For Wheeled Mobile Robots
Document6 pages
Four-State Trajectory-Tracking Control Law For Wheeled Mobile Robots
wawee
No ratings yet
Pattern Classification: Second Edition
Document11 pages
Pattern Classification: Second Edition
jojo
No ratings yet
Fractional Order Derivative and Integral Using LabVIEW
Document13 pages
Fractional Order Derivative and Integral Using LabVIEW
auralius
No ratings yet
Simulated Annealing
Document11 pages
Simulated Annealing
gabby209
No ratings yet
An Integrated Survey of Project Scheduling
Document36 pages
An Integrated Survey of Project Scheduling
Rodrigo Giorgi
No ratings yet
Laws of Thermodynamics
Document5 pages
Laws of Thermodynamics
Robert Onsare
No ratings yet
Gauss Jordan - Algorithm and Matlab Program
Document3 pages
Gauss Jordan - Algorithm and Matlab Program
Mohit Singh
No ratings yet
Scaffolding A Math Problem: Solving For A Single Variable
Document14 pages
Scaffolding A Math Problem: Solving For A Single Variable
Nel Bornia
No ratings yet
HW6
Document2 pages
HW6
Thomas Rhee
0% (1)
Polynomial Functions: Notice An Error? Please Let Us Know!
Document2 pages
Polynomial Functions: Notice An Error? Please Let Us Know!
Eugene Sarmiento
No ratings yet
Decision Theory - Quiz
Document4 pages
Decision Theory - Quiz
shubhamgoelpgdmrm23
No ratings yet
Cross Docking
Document7 pages
Cross Docking
MariusCristian Luca
No ratings yet
Anupama
Document1 page
Anupama
Abdul Adil Ansari
No ratings yet
Signal & Systems Assignment 01 Questions
Document7 pages
Signal & Systems Assignment 01 Questions
pavan
No ratings yet
EE370 Lab Experiment 01
Document6 pages
EE370 Lab Experiment 01
Ayman Younis
No ratings yet
Understand Concept of Multi-Rate Signal Processing: (Autonomous College Affiliated To University of Mumbai)
Document2 pages
Understand Concept of Multi-Rate Signal Processing: (Autonomous College Affiliated To University of Mumbai)
nicO nee
No ratings yet
Alternating Split of A Given Singly Linked List
Document10 pages
Alternating Split of A Given Singly Linked List
akg299
No ratings yet
Experiment No: 01: Name of Experiment Objective
Document7 pages
Experiment No: 01: Name of Experiment Objective
Sanjid Elahi
No ratings yet
TC2 - Linear Equations
Document3 pages
TC2 - Linear Equations
Ammycynthiaachan aca
No ratings yet
Lesson 7 - Hardware and Software Theft, Vandalism and Failure
Document3 pages
Lesson 7 - Hardware and Software Theft, Vandalism and Failure
Jambres Delacruz
100% (1)
Option Delta With Skew Adjustment
Document33 pages
Option Delta With Skew Adjustment
Tze Shao
100% (1)
QBA (Class Test) Autumn 2020
Document1 page
QBA (Class Test) Autumn 2020
Nusrat Islam
No ratings yet
Mahendra College of Engineering: Lesson Plan
Document4 pages
Mahendra College of Engineering: Lesson Plan
Anonymous Ndsvh2so
No ratings yet