Contoh Code Klastering Alur Hirarkial

Uploaded by

aditya Nugroho

0% found this document useful (0 votes)

2 views2 pages

Copyright

Available Formats

TXT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views2 pages

Contoh Code Klastering Alur Hirarkial

Uploaded by

aditya Nugroho

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

#Alur Hirarkial/////////////////////////////////////////////////////////////

#delete var1
del df['Var_1']

#IMPORT LIBRARY
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import AgglomerativeClustering
from scipy.cluster.hierarchy import dendrogram, linkage
from sklearn.metrics import silhouette_score
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.metrics.pairwise import cosine_similarity

#melihat jumlah NaN

df.isna().sum()

#PENGECEKAN VALUE DARI COLUMN

df['Pengalaman Kerja'].value_counts()

#Replace paling banyak

df['Pernah_Menikah'] = df['Pernah_Menikah'].fillna("Ya")
df['Lulusan Pendidikan'] = df['Lulusan Pendidikan'].fillna("Ya")
df['Pekerjaan'] = df['Pekerjaan'].fillna("Artis")
df['Pengalaman Kerja'] = df['Pengalaman Kerja'].fillna(1.0)
df['Jumlah Keluarga'] = df['Jumlah Keluarga'].fillna(2.0)

#mengubah karakrter menjadi nilai

from sklearn.preprocessing import OrdinalEncoder
ord_enc = OrdinalEncoder()
df["Usia"] = ord_enc.fit_transform(df[["Usia"]])
df["Pekerjaan"] = ord_enc.fit_transform(df[["Pekerjaan"]])
df["Jenis Kelamin"] = ord_enc.fit_transform(df[["Jenis Kelamin"]])
df["Pernah_Menikah"] = ord_enc.fit_transform(df[["Pernah_Menikah"]])
df["Lulusan Pendidikan"] = ord_enc.fit_transform(df[["Lulusan Pendidikan"]])
df["Besar Pengeluaran"] = ord_enc.fit_transform(df[["Besar Pengeluaran"]])
df.head(5)

#
scaler = StandardScaler()
scaled_data = scaler.fit_transform(df)
df = pd.DataFrame(scaled_data, columns= df.columns)
df

#Grafik
plt.figure(figsize=(15,10))
dendrogram(linkage(df, method="ward"), leaf_rotation=90, p=5, color_threshold=20,
leaf_font_size=10, truncate_mode='level')
plt.show()

#MENCARI JUMLAH CLUSTER YANG PALING OPTIMAL DENGAN SILHOUETE SCORE

from sklearn.metrics import silhouette_score
silhouette_scores = []
for n_cluster in range(2,10):
index = n_cluster-2
silhouette_scores.append(
silhouette_score(df, AgglomerativeClustering(n_clusters =
n_cluster).fit_predict(df)))
print("silhouette_score for n_cluster = ",n_cluster," is
",silhouette_scores[index])
plt.bar(range(2,10), silhouette_scores)
plt.xlabel("number of cluster",fontsize=10)
plt.ylabel("Silhouette score",fontsize=10)
plt.show()

#
agglo = AgglomerativeClustering(n_clusters = 3)
agglo.fit(df)
labels = agglo.labels_
df = pd.concat([df, pd.DataFrame({'cluster' : labels})], axis=1)
df.head(5)

#
for i in df :
grid = sns.FacetGrid(df, col='cluster')
grid.map(plt.hist, i)

#
#DECOMPOSISI PCA
dist = 1 - cosine_similarity(df)
pca = PCA(n_components = 2)
pca = pca.fit_transform(dist)

#
#VISUALISASI
x, y = pca[:,0], pca[:,1]
warna = {
0 : 'red',
1 : 'green',
2 : 'yellow'
}
label_pca = {
0 : 'cluster 1',
1 : 'cluster 2',
2 : 'cluster 3'
}
df = pd.DataFrame({'x' : x, 'y' : y, 'label' : labels})
groups = df.groupby('label')
fig, ax = plt.subplots(figsize=(15,10))
for name, group in groups :
ax.plot(group.x, group.y, marker='o', linestyle='', ms=5,
color=warna[name], label = label_pca[name], mec='none')
ax.set_aspect('auto')
ax.tick_params(axis='x', which = 'both', bottom = 'off', top = 'off',
labelbottom = 'off')
ax.tick_params(axis='y', which = 'both', bottom = 'off', top = 'off',
labelbottom = 'off')
ax.legend()
ax.set_title("Visualisasi Agglomerative Clustering")
plt.show()

Delhivery Mani
Document79 pages
Delhivery Mani
kishore kumar
No ratings yet
AnalytixLabs - Data Science Using Python9
Document15 pages
AnalytixLabs - Data Science Using Python9
pundirsandeep
No ratings yet
Machine Learning Notes: 2. All The Commands For Eda
Document5 pages
Machine Learning Notes: 2. All The Commands For Eda
naveen katta
100% (1)
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
RFM Model For Customer Purchase Behaviour Using K-Means Algorithm
Document55 pages
RFM Model For Customer Purchase Behaviour Using K-Means Algorithm
Shubhankar
No ratings yet
Practicle6 (Code)
Document4 pages
Practicle6 (Code)
Pallavi Gaikwad
No ratings yet
23MCA1104 - EX10 - KMEANS - Ipynb - Colab
Document1 page
23MCA1104 - EX10 - KMEANS - Ipynb - Colab
Piyush Verma
No ratings yet
Aiml Lab Manual 2023
Document17 pages
Aiml Lab Manual 2023
shamilie17
No ratings yet
23MCA1104 - Exercise - 10 - Hierarchical Clustering - Ipynb - Colab
Document2 pages
23MCA1104 - Exercise - 10 - Hierarchical Clustering - Ipynb - Colab
Piyush Verma
No ratings yet
Data Science Libraries
Document4 pages
Data Science Libraries
ayushyadav73095
No ratings yet
Big Data Merged
Document7 pages
Big Data Merged
Ingame Id
No ratings yet
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Document4 pages
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Sandip Das
No ratings yet
ML
Document7 pages
ML
21eg105f37
No ratings yet
Untitled2.ipynb - Colaboratory
Document2 pages
Untitled2.ipynb - Colaboratory
rvvnbrao
No ratings yet
EE 559 HW2Code PDF
Document7 pages
EE 559 HW2Code PDF
Ali
No ratings yet
Kmeans Clustering Implementation Using Python
Document5 pages
Kmeans Clustering Implementation Using Python
Poornima Ghodke
No ratings yet
16BCB0126 VL2018195002535 Pe003
Document40 pages
16BCB0126 VL2018195002535 Pe003
Mohit
No ratings yet
Task:-5: Name:-Shambel Gonfa Reg no:-18BCE2429 Data Vitualization Lab Course code:-CSE3020
Document8 pages
Task:-5: Name:-Shambel Gonfa Reg no:-18BCE2429 Data Vitualization Lab Course code:-CSE3020
Iyyaasuu Yaadataa
No ratings yet
C121 Exp1
Document32 pages
C121 Exp1
Devanshu Maheshwari
No ratings yet
ML Lab Record
Document15 pages
ML Lab Record
rr3870044
No ratings yet
Laboratoare SBC
Document17 pages
Laboratoare SBC
Denisa Alina
No ratings yet
Week 8. K-Means
Document7 pages
Week 8. K-Means
revaldianggara
No ratings yet
Kmeans
Document7 pages
Kmeans
patil samrudhi
No ratings yet
Sample 1
Document3 pages
Sample 1
m03479368
No ratings yet
Code
Document6 pages
Code
Keerti Gulati
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
Document22 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
Ahm Tharwat
No ratings yet
Unit1 ML Programs
Document5 pages
Unit1 ML Programs
diroja5648
No ratings yet
CODIGO#
Document4 pages
CODIGO#
deger treuri
No ratings yet
Support Vector Machine
Document3 pages
Support Vector Machine
VIJAY YADAV
No ratings yet
Ai ML Programs
Document34 pages
Ai ML Programs
Yasar Bilal
No ratings yet
22MCA1008 - Varun ML LAB ASSIGNMENTS
Document41 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
S Varun (RA1931241020133)
100% (1)
Linear Regression Implementation Model For House-Price Prediction System
Document3 pages
Linear Regression Implementation Model For House-Price Prediction System
Anurag Pandey
No ratings yet
ML 1-10
Document53 pages
ML 1-10
22128008
No ratings yet
Nadya Faudilla - 1806198471 - Geologi Komputasi 5 Dan 6 - Jupyter Notebook
Document9 pages
Nadya Faudilla - 1806198471 - Geologi Komputasi 5 Dan 6 - Jupyter Notebook
Emir Rakhim
No ratings yet
Cod SBC
Document16 pages
Cod SBC
Denisa Alina
No ratings yet
EDA Plots Code
Document13 pages
EDA Plots Code
prashant yadav
No ratings yet
(Big Data Analytics With PySpark) (CheatSheet)
Document7 pages
(Big Data Analytics With PySpark) (CheatSheet)
Niwahereza Dan
No ratings yet
3a Data Frame - Jupyter Notebook
Document5 pages
3a Data Frame - Jupyter Notebook
venkatesh m
No ratings yet
QDA in Python
Document3 pages
QDA in Python
Jaydev Raval
No ratings yet
Asep Purnama - 140710180027 - Praktik LSTM
Document9 pages
Asep Purnama - 140710180027 - Praktik LSTM
Asep
No ratings yet
7 - 201904121342. Lampiran Skripsi
Document65 pages
7 - 201904121342. Lampiran Skripsi
ilfisyafa
No ratings yet
1 Kmeans-Pratical-No-1
Document8 pages
1 Kmeans-Pratical-No-1
Akanksha Supare
No ratings yet
FA Notes
Document2 pages
FA Notes
Shreya Garg
No ratings yet
Grid Search For SVM
Document9 pages
Grid Search For SVM
kPrasad8
No ratings yet
0 Aimlfinal
Document24 pages
0 Aimlfinal
arvindhrk05
No ratings yet
Import As Import As Import As: 'Social - Network - Ads - CSV'
Document2 pages
Import As Import As Import As: 'Social - Network - Ads - CSV'
lucansitumeang21
No ratings yet
Texte Classification
Document9 pages
Texte Classification
Ala Arboun
No ratings yet
Python CA 4
Document9 pages
Python CA 4
subham patra
No ratings yet
Python Slips
Document9 pages
Python Slips
emmamusk061
No ratings yet
Appendix PDF
Document5 pages
Appendix PDF
Rama
No ratings yet
Cardio Screen RF
Document27 pages
Cardio Screen RF
The Mind
100% (1)
Mercedes-Benz Greener Manufacturing Ai
Document16 pages
Mercedes-Benz Greener Manufacturing Ai
Puji
0% (1)
PGM 7
Document3 pages
PGM 7
badeni
No ratings yet
Import
Document15 pages
Import
Satyam Yadav
No ratings yet
DAA Record
Document15 pages
DAA Record
dharun0704
No ratings yet
CV Assignment 2 Group02
Document12 pages
CV Assignment 2 Group02
Manash Barman
No ratings yet
Principal Component Analysis Notes : Info
Document22 pages
Principal Component Analysis Notes : Info
VALMICK GUHA
No ratings yet
Python Note 3
Document11 pages
Python Note 3
Coding Knowledge
No ratings yet
Tutorial Classification Py
Document7 pages
Tutorial Classification Py
Lucas Z
No ratings yet
Artificial Intelligence (18Csc305J) Lab: EXPERIMENT 13: Implementation of NLP Problem
Document9 pages
Artificial Intelligence (18Csc305J) Lab: EXPERIMENT 13: Implementation of NLP Problem
SAILASHREE PANDAB (RA1911032010030)
No ratings yet
Assignment 10 2
Document4 pages
Assignment 10 2
dash
No ratings yet
P03 A Star Algorithm 35 Anushka Shetty
Document23 pages
P03 A Star Algorithm 35 Anushka Shetty
anohanabrotherhoodcave
No ratings yet
CMPT 641-Digital Transformation Plan - Phase 1 - Part 2
Document18 pages
CMPT 641-Digital Transformation Plan - Phase 1 - Part 2
raghav
No ratings yet
JaydwinLabiano - WhatData MIne
Document9 pages
JaydwinLabiano - WhatData MIne
Jaydwin Labiano
No ratings yet
Chapter 5: Selecting A Sample: Objectives
Document48 pages
Chapter 5: Selecting A Sample: Objectives
nadi_asha
No ratings yet
Distances: Business Analytics - The Science of Data Driven Decision Making
Document33 pages
Distances: Business Analytics - The Science of Data Driven Decision Making
Asmita Nagpal
No ratings yet
DWDM R19 Unit 1
Document27 pages
DWDM R19 Unit 1
GAYATHRI KAMMARA 19MIS7006
No ratings yet
Encog 3 - 3 Devguide PDF
Document47 pages
Encog 3 - 3 Devguide PDF
Abian
No ratings yet
Retail Customer Segmentation Using SAS
Document19 pages
Retail Customer Segmentation Using SAS
priya
No ratings yet
Authentication in MANETs Yang Sadasivam Final
Document7 pages
Authentication in MANETs Yang Sadasivam Final
VijayKumar Lokanadam
No ratings yet
NetLogo K-Means Guidelines
Document3 pages
NetLogo K-Means Guidelines
vignesh
No ratings yet
20EC067 AI Assignment
Document32 pages
20EC067 AI Assignment
20EC081 Satyam Shukla
No ratings yet
USL - Problem Statement
Document3 pages
USL - Problem Statement
Market Charcha
No ratings yet
Wongetal - HKIE2019
Document13 pages
Wongetal - HKIE2019
Afrizal Syahbana
No ratings yet
Web Server Log Analysis Sysytem
Document3 pages
Web Server Log Analysis Sysytem
Nexgen Technology
No ratings yet
Unsupervised Machine Learning - What Is, Algorithms, Example
Document11 pages
Unsupervised Machine Learning - What Is, Algorithms, Example
Cristina
No ratings yet
Cse-Ai 2-2 Sem Cs&Syllabus Ug r20
Document19 pages
Cse-Ai 2-2 Sem Cs&Syllabus Ug r20
Aditya Kumar Tikkireddi
No ratings yet
Thesis Book 2
Document57 pages
Thesis Book 2
w6jkzfc7pg
No ratings yet
Data Mining:: Dr. Hany Saleeb
Document37 pages
Data Mining:: Dr. Hany Saleeb
Deepthi Prasad
No ratings yet
Monitoring The Network Monitoring System: Anomaly Detection Using Pattern Recognition
Document4 pages
Monitoring The Network Monitoring System: Anomaly Detection Using Pattern Recognition
mary isaak
No ratings yet
Postgraduate Degree Course
Document11 pages
Postgraduate Degree Course
Sumaiya Majeed
No ratings yet
Clna17669enc 001 PDF
Document372 pages
Clna17669enc 001 PDF
sdiaman
No ratings yet
What Is MIME?: MIME As An Internet Protocol
Document8 pages
What Is MIME?: MIME As An Internet Protocol
myprofile0225
No ratings yet
An Integration of K-Means and Decision Tree (ID3) Towards A More Efficient Data Mining Algorithm
Document7 pages
An Integration of K-Means and Decision Tree (ID3) Towards A More Efficient Data Mining Algorithm
Journal of Computing
No ratings yet
Ece
Document6 pages
Ece
Ahmad Khanif Fikri
No ratings yet
Industrial Engineering Thesis Sample
Document6 pages
Industrial Engineering Thesis Sample
bseb81xq
100% (2)
SK Learn
Document9 pages
SK Learn
dome
No ratings yet
KEEL: A Data Mining Software Tool Integrating Genetic Fuzzy Systems
Document7 pages
KEEL: A Data Mining Software Tool Integrating Genetic Fuzzy Systems
zoombados
No ratings yet
Unit 3 Supervised Learning
Document89 pages
Unit 3 Supervised Learning
narmathaapcse
No ratings yet
Introduction To Computation and Programming Using Python Third Edition John V Guttag Full Chapter
Document51 pages
Introduction To Computation and Programming Using Python Third Edition John V Guttag Full Chapter
agnes.baer637
100% (4)