Professional Documents
Culture Documents
Designing Clusters
Designing Clusters
How this is going to work...
We have four scenarios to choose from
● A Data Scientist training the first iteration of a model
● A SQL Analyst developing a report to be ran once a month
● A Team of 10 Data Analyst executing ad-hoc queries
● A Data Engineer processing a weekly job to ingest customer records
● The cluster setup including Cluster Category, VM Level and Compute Level
Designing Clusters
Cluster Categories & VM Levels
Memory Optimized Compute Optimized
Type Memory Cores $/Hour Type Memory Cores $/Hour
M-1 32 GB 4 $0.252 C-1 16 GB 8 $0.340
M-2 64 GB 8 $0.504 C-2 32 GB 16 $0.680
M-3 128 GB 16 $1.008 C-3 64 GB 32 $1.360
M-4 256 GB 32 $2.016 C-4 128 GB 64 $2.720