Professional Documents
Culture Documents
College of Engineering: Department of Information Technology SUBJECT: DS Using Python Lab
College of Engineering: Department of Information Technology SUBJECT: DS Using Python Lab
COLLEGE OF ENGINEERING
DEPARTMENT OF INFORMATION TECHNOLOGY
SUBJECT: DS using Python Lab
ACADEMIC YEAR: 2021-2022
COURSE CODE ITL605
PRACTICAL NO. 03
ROLL NO. 13
CLASS TE-IT
SEMESTER VI
GIVEN DATE 15/02/2022
CORRECTION DATE
REMARK
04 04 07 15
NAME& SIGN.
Prof Bhavna Dakhare
OF FACULTY
EXPERIMENT 03
AIM: To perform data Modelling on given dataset using python
THEORY:
As we work with datasets, a machine learning algorithm works in two stages. We usually
split the data around 20%-80% between testing and training stages. Under supervised
learning, we split a dataset into a training data and test data in Python ML.
a.
Prerequisites for Train and Test Data
(a). Partition the data set, for example 75% of the records are included in the training data
set and 25% are included in the test data set. Use a bar graph to confirm your
proportions
(b). Identify the total number of records in the training data set .
(c) Validate your partition by performing a two ‐sample Z ‐test.
Colab Link :
https://colab.research.google.com/drive/1Tjb5LroRPzTK54SCl2hectjM_jXL1qcz?
usp=sharing
Conclusion: Thus, we learned about the importance of splitting data into training
and testing sets as by splitting data into training and testing we can use it to check
the accuracy of our model. Furthermore, we imported a dataset into a pandas Data
frame and then used sklearn to split the data into training and testing sets