Professional Documents
Culture Documents
1. Aim of the practical: Understand supervised learning to train and develop classifier
model using PyCaret.
3. Basic Concept/ Command Description: In this experiment we have installed PyCaret and
imported a predefined dataset and trained and developed classifier model using PyCaret.
4. Code with screenshot of output:
Pycaret installed
'2.3.5'
2) Classification: Basics
dataset = get_data('index') #This will give index of all the available dataset in Pycare
Target
Data Default Target
Dataset Variable
Types Task Variable 1
2
Anomaly
0 anomaly Multivariate None None
Detection
Association
1 france Multivariate InvoiceNo Description
Rule Mining
Association
2 germany Multivariate InvoiceNo Description
Rule Mining
Classification
3 bank Multivariate deposit None
(Binary)
Classification
4 blood Multivariate Class None
(Binary)
Classification
5 cancer Multivariate Class None
(Binary)
Classification
6 credit Multivariate default None
(Binary)
Classification
7 diabetes Multivariate Class variable None
(Binary)
Classification
8 electrical_grid Multivariate stabf None
(Binary)
Classification
9 employee Multivariate left None
(Binary)
Classification
10 heart Multivariate DEATH None
(Binary)
Classification
11 heart_disease Multivariate Disease None
(Binary)
Classification
12 hepatitis Multivariate Class None
(Binary)
Classification
13 income Multivariate income >50K None
(Binary)
Classification
14 juice Multivariate Purchase None
(Binary)
Classification
15 nba Multivariate TARGET_5Yrs None
(Binary)
Classification
16 wine Multivariate type None
(Binary)
Classification
17 telescope Multivariate Class None
(Binary)
Classification
18 titanic Multivariate Survived None
(Binary)
Classification
19 us_presidential_election_results Multivariate party_winner None
(Binary)
Classification
20 glass Multivariate Type None
(Multiclass)
Classification
21 iris Multivariate species None
(Multiclass)
Classification
22 poker Multivariate CLASS None
(Multiclass)
Classification
23 questions Multivariate Next_Question None
(Multiclass)
Classification
24 satellite Multivariate Class None
(Multiclass)
Classification
25 CTG Multivariate NSP None
(Multiclass)
31
dataset_of_diabetes mice Multivariate
= get_data("diabetes") Clustering
#This will give theNone
dataset ofNone
'diabete
print(type(dataset_of_diabetes))
32 migration Multivariate Clustering None None
33 perfume Multivariate
Plasma Clustering None None
glucose
34 pokemon Multivariate Clustering
2-Hour BodyNone
mass None
concentration Diastolic Triceps
Number serum index Diabetes
35 of times a 2 hours in
population blood skin fold
Multivariate Clustering None
insulin (weight None
pedigree
pressure thickness in
an oral (ye
pregnant (mu kg/(height function
36 public_health(mm
glucose Hg)
Multivariate (mm)
Clustering None None
U/ml) in m)^2)
tolerance
37 testseeds Multivariate Clustering None None
38
0 6 wholesale
148 Multivariate
72 Clustering
35 0 None
33.6 None
0.627
39
1 1 tweets
85 66 Text 29 NLP 0 tweet
26.6 None
0.351
3.1) Using Random Forest Algo with confusion matrix For the performance
measurement of the classification.
data.columns
#Gives all the columns
448
5. Additional Creative Inputs (If Any):
Learning outcomes (What I have learnt):