Welcome to Scribd!

Skip carousel

Victoryap - One Cluster To Rule Them All

Uploaded by

ku.madan05

0% found this document useful (0 votes)

2 views25 pages

MLOPS PPT

Original Title

victoryap_one-cluster-to-rule-them-all

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

MLOPS PPT

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

2 views25 pages

Victoryap - One Cluster To Rule Them All

Uploaded by

ku.madan05

MLOPS PPT

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 25

Search inside document

One Cluster to Rule

Them All
ML on the Cloud
Victor Yap
MLOps Engineer

Rev.com / Rev.ai
Speech Recognition for Customers
Autoscaling

Rightsizing

Spot Instances

Environment Management
Ray

Kubernetes

Karpenter
Ray - “a simple, universal API
for building distributed applications”
AWS Batch?

Slurm?
Ray is Python-Friendly
ray.init() # starts a local cluster in one line

@ray.remote(num_cpus=1, memory=1024 ** 3)
def preprocess(data):
pass # do stuff

@ray.remote(num_gpus=1, accelerator_type="p2")
def train():
pass # do stuff
Kubernetes
Karpenter
Demo
import ray

# ray.init() will use the local system to run a cluster

ray.init()

# ray.init accepts an address to connect to a cluster

ray.init("ray://127.0.0.1:10001")
import time
from contextlib import contextmanager

@contextmanager
def timer():
"""Context manager to measure running time of code."""
start = time.time()
yield
time_elapsed = round(time.time() - start)
print(f"timer: took {time_elapsed} seconds")
def get_instance_type():
"""Returns what instance type this function is running on."""
import requests
token = requests.put(
"http://169.254.169.254/latest/api/token",
headers={"X-aws-ec2-metadata-token-ttl-seconds": "21600"}
).text
instance_type = requests.get(
"http://169.254.169.254/latest/meta-data/instance-type",
headers={"X-aws-ec2-metadata-token": token},
).text
return instance_type
def print_cluster_resources():
"""Prints the CPUs, memory and GPUs of the current Ray cluster."""
cluster_resources = ray.cluster_resources()
CPUs = int(cluster_resources["CPU"])
memory = round(cluster_resources["memory"] / (1000 ** 3))
GPUs = round(cluster_resources.get("GPU", 0))
print(f"CPUs = {CPUs}, memory = {memory}G, GPUs = {GPUs}")
@ray.remote(num_cpus=1, memory=1000 ** 3)
def preprocess(data):
time.sleep(1)
return get_instance_type()

with timer():
print(ray.get(preprocess.remote("data")))

t3.2xlarge
timer: took 2 seconds
print_cluster_resources()

from collections import Counter

with timer():
print(Counter(
ray.get([preprocess.remote(x) for x in range(60)])
))

CPUs = 4, memory = 9G, GPUs = 0

Counter({'t3.2xlarge': 60})
timer: took 16 seconds
from collections import Counter
with timer():
print(Counter(
ray.get([preprocess.remote(x) for x in range(6000)])
))

print_cluster_resources()

Counter({'m6a.48xlarge': 4443, 'm6a.32xlarge': 1362, 'c6id.4xlarge': 195})

timer: took 50 seconds
CPUs = 292, memory = 442G, GPUs = 0
@ray.remote(memory=100 * 1000 ** 3)
def preprocess_big_data():
return get_instance_type()

print(ray.get(preprocess_big_data.remote()))
print_cluster_resources()

i4i.8xlarge
CPUs = 34, memory = 197G, GPUs = 0
@ray.remote(num_gpus=4, accelerator_type="p2")
def train():
return get_instance_type()

with timer():
print(ray.get(train.remote()))
print_cluster_resources()

p2.xlarge
timer: took 178 seconds
CPUs = 37, memory = 239G, GPUs = 1
@ray.remote(num_gpus=4, accelerator_type="p2")
def train():
return get_instance_type()

with timer():
print(ray.get(train.remote()))
print_cluster_resources()

(issues in Karpenter prevented this from actually working)

Should You Try This?

Choose Proven Technologies?

Demo Code:

github.com/vicyap/mlops-world-2022

035 Assignment PDF
Document14 pages
035 Assignment PDF
Tman Letswalo
No ratings yet
2021.09.20 IGB Case Study 6 - DRW Technologies
Document3 pages
2021.09.20 IGB Case Study 6 - DRW Technologies
Samar Singh
100% (1)
Writing Efficient R Code
Document5 pages
Writing Efficient R Code
Octavio Flores
No ratings yet
Berkeley's Algorithm
Document9 pages
Berkeley's Algorithm
عبدالرحمن يحي صلح
No ratings yet
CN-1 Lab File
Document37 pages
CN-1 Lab File
Aakriti
No ratings yet
361 Project Code
Document10 pages
361 Project Code
skdlf
No ratings yet
ACN Lab - 3
Document4 pages
ACN Lab - 3
Jagrit Sharma
No ratings yet
Fall Semester 2021-22 Mathematical Modelling For Data Science CSE 3045
Document8 pages
Fall Semester 2021-22 Mathematical Modelling For Data Science CSE 3045
PRAKHAR MISHRA
No ratings yet
Lecture 10 Deep Reinforcement Learning For Cloud Edge Example
Document9 pages
Lecture 10 Deep Reinforcement Learning For Cloud Edge Example
Dipesh Agrawal
No ratings yet
Expt 10 Time Server
Document5 pages
Expt 10 Time Server
kairo_ace
No ratings yet
Vxworks - Final PDF
Document125 pages
Vxworks - Final PDF
Bhanu Prakash K
100% (1)
Notes 5 Working With NumPy 1sep2022
Document12 pages
Notes 5 Working With NumPy 1sep2022
jeremy
No ratings yet
Pyan Net
Document4 pages
Pyan Net
nikonsugar
No ratings yet
Wallpapers Script
Document5 pages
Wallpapers Script
Jamie Berardi
No ratings yet
Enthought: Numpy Tutorial Scipy 2016
Document49 pages
Enthought: Numpy Tutorial Scipy 2016
tb
No ratings yet
ResNet50 Training Code
Document9 pages
ResNet50 Training Code
natashamarie.relampagos
No ratings yet
Os and CN Lab
Document70 pages
Os and CN Lab
18-5N7 Chetan reddy
No ratings yet
DAA Practical
Document68 pages
DAA Practical
Harsh
No ratings yet
Name: Niharika Gupta Enrollment No: 02511604420 Section - 1 Subject: Python Programming Assignment
Document7 pages
Name: Niharika Gupta Enrollment No: 02511604420 Section - 1 Subject: Python Programming Assignment
NIHARIKA GUPTA
No ratings yet
Introduction To Python (Part III)
Document29 pages
Introduction To Python (Part III)
Subhradeep Pal
No ratings yet
21BCE9371-CSE-LAB-01 Reference
Document13 pages
21BCE9371-CSE-LAB-01 Reference
Sai Suhas
No ratings yet
Department of Computer Applications National Institute of Technology Tiruchirappalli-15
Document36 pages
Department of Computer Applications National Institute of Technology Tiruchirappalli-15
lalchand xangti
No ratings yet
Python Generators PDF
Document6 pages
Python Generators PDF
Rajendra Buchade
No ratings yet
MCA CN Lab Manual 2019 Vtu
Document22 pages
MCA CN Lab Manual 2019 Vtu
Hemanth Aradhya
0% (1)
09 ParallelizationRecap PDF
Document62 pages
09 ParallelizationRecap PDF
giordano mancini
No ratings yet
ML2 Practical List
Document80 pages
ML2 Practical List
Yash Amin
No ratings yet
Ass
Document5 pages
Ass
Taqwa Elsayed
No ratings yet
Pyro Tutorial
Document23 pages
Pyro Tutorial
mma666
No ratings yet
Enthought: Introduction To Numerical Computing With Numpy
Document39 pages
Enthought: Introduction To Numerical Computing With Numpy
Marjo Kaci
No ratings yet
Miscellaneous R Notes: C 2005 Ben Bolker September 15, 2005
Document2 pages
Miscellaneous R Notes: C 2005 Ben Bolker September 15, 2005
juntujuntu
No ratings yet
Dcn-Lab File
Document41 pages
Dcn-Lab File
Kartikey Rana
No ratings yet
Assignment 6
Document17 pages
Assignment 6
Darshan Singh
No ratings yet
WNP
Document15 pages
WNP
Bharath
No ratings yet
Recurrent Neural Networks: Prof. Gheith Abandah
Document32 pages
Recurrent Neural Networks: Prof. Gheith Abandah
rana
No ratings yet
New
Document3 pages
New
Pradeep Moorthy
No ratings yet
Vxworks Lab Manual Embedded
Document44 pages
Vxworks Lab Manual Embedded
Shubham Gupta
No ratings yet
Docker Terraform Amazon ECS: A Quick Intro To
Document98 pages
Docker Terraform Amazon ECS: A Quick Intro To
Dodo winy
No ratings yet
Hammer Py
Document4 pages
Hammer Py
chim chanoudom
No ratings yet
Chapter1-Working With Big Data
Document44 pages
Chapter1-Working With Big Data
Komi David ABOTSITSE
No ratings yet
Python Solutions To Bessel Function Problems
Document11 pages
Python Solutions To Bessel Function Problems
Joshua Cook
No ratings yet
QLSTMvs LSTM
Document7 pages
QLSTMvs LSTM
mohamedaligharbi20
No ratings yet
Numpy Complete Material
Document19 pages
Numpy Complete Material
Leonardo Camacho Arias
No ratings yet
Assignment 2b Advanced Programming in R: Ika Pratiwi
Document6 pages
Assignment 2b Advanced Programming in R: Ika Pratiwi
Ika Pratiwi
No ratings yet
Pyshed - Doc For Python Library
Document23 pages
Pyshed - Doc For Python Library
Limin Zhang
No ratings yet
5 6240039722433905851
Document7 pages
5 6240039722433905851
WARRIOR GAMING
No ratings yet
Lab Report Solutions - Multimedia
Document10 pages
Lab Report Solutions - Multimedia
ddmonk052
No ratings yet
TASK#3:: Code For Threesum
Document8 pages
TASK#3:: Code For Threesum
Malik Naveed
No ratings yet
INF5090 NS 3 Tutorial 2011 Oslo Slides
Document48 pages
INF5090 NS 3 Tutorial 2011 Oslo Slides
Wondale Kebede
No ratings yet
Say Goodbye To Loops in Python, and Welcome Vectorization! - by Anmol Tomar - CodeX - Medium
Document10 pages
Say Goodbye To Loops in Python, and Welcome Vectorization! - by Anmol Tomar - CodeX - Medium
zuti
No ratings yet
Assignment 7
Document19 pages
Assignment 7
Darshan Singh
No ratings yet
Analisis Espaciales
Document4 pages
Analisis Espaciales
angelica villalob lopez
No ratings yet
Main
Document15 pages
Main
victor manuel diaz
No ratings yet
Econ589multivariateGarch R
Document4 pages
Econ589multivariateGarch R
JasonClark
No ratings yet
Python Lab
Document6 pages
Python Lab
Tanu
No ratings yet
Back Propogation Training
Document6 pages
Back Propogation Training
arun_1328
No ratings yet
DC 1
Document18 pages
DC 1
meghnanidisha
No ratings yet
Lab 1 TCP2101 ALGORITHM DESIGN AND ANALYSIS
Document12 pages
Lab 1 TCP2101 ALGORITHM DESIGN AND ANALYSIS
Husna Zulkefli
No ratings yet
Implementation of Time Series Forecasting
Document12 pages
Implementation of Time Series Forecasting
Soba C
No ratings yet
2.3 Aiml Rishit
Document7 pages
2.3 Aiml Rishit
heex.pros
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Mineral Wool Pipe Insulation
Document4 pages
Mineral Wool Pipe Insulation
Miguel Martinez Guerrero
No ratings yet
(Type The Document Title) : PC (Type The Company Name) (Pick The Date)
Document5 pages
(Type The Document Title) : PC (Type The Company Name) (Pick The Date)
Mohamed Kordi
No ratings yet
Performance Improvement Plan (PIP) Sample - 1
Document3 pages
Performance Improvement Plan (PIP) Sample - 1
Yash
No ratings yet
Top Drive HS
Document12 pages
Top Drive HS
luis sarmiento
No ratings yet
Unit IV Kerberos
Document13 pages
Unit IV Kerberos
Chinamayi Chinmayi
No ratings yet
Unit 1
Document47 pages
Unit 1
prathameshkaleelect
No ratings yet
An Ambush: Troop Leading Procedures
Document4 pages
An Ambush: Troop Leading Procedures
jezalyn
No ratings yet
One Way Slab Design
Document19 pages
One Way Slab Design
Winston Advincula
100% (1)
G Mechanical Engineering FINAL
Document2 pages
G Mechanical Engineering FINAL
Maaz Uddin Siddiqui
No ratings yet
Promax Hihg Performance End Mill Flyer Us556
Document2 pages
Promax Hihg Performance End Mill Flyer Us556
api-521872070
No ratings yet
All About MC2 36 Audio Console and Master Control /logo/ Insertor Media Tutorial
Document84 pages
All About MC2 36 Audio Console and Master Control /logo/ Insertor Media Tutorial
Habtamu Tadesse
No ratings yet
Operator Controls: Operation and Maintenance Manual
Document28 pages
Operator Controls: Operation and Maintenance Manual
iwan nawi
No ratings yet
PY050
Document7 pages
PY050
GODFREY MASHINGAIDZE
No ratings yet
Rolls-Royce Silver Spirit - Vehicle Identification Numbers
Document1 page
Rolls-Royce Silver Spirit - Vehicle Identification Numbers
Thamer Asa
No ratings yet
SCE en 011-101 Hardware Configuration S7-1200 R1709
Document56 pages
SCE en 011-101 Hardware Configuration S7-1200 R1709
shailesh284
No ratings yet
Cyber Security
Document18 pages
Cyber Security
Anshitha PA
No ratings yet
Kyocera FS-1128MFP User Manual
Document439 pages
Kyocera FS-1128MFP User Manual
Ken Franco
No ratings yet
Lithium Ion Battery Identification Reference
Document5 pages
Lithium Ion Battery Identification Reference
Uncle Pane
No ratings yet
LT101A MP Series1 Manual
Document2 pages
LT101A MP Series1 Manual
Дмитро Селютін
No ratings yet
Semi-Hermetic Reciprocating Compressors: FCAT100.6-EN Product Selection Catalogue Version 50 HZ v9
Document76 pages
Semi-Hermetic Reciprocating Compressors: FCAT100.6-EN Product Selection Catalogue Version 50 HZ v9
Bora
No ratings yet
The Indian Law Institute: Bhagwan Das Road, New Delhi - 110 001 Website: Phone: 011-23386321, 23382190
Document4 pages
The Indian Law Institute: Bhagwan Das Road, New Delhi - 110 001 Website: Phone: 011-23386321, 23382190
dibpal
No ratings yet
Instruction Manual: Centrifugal Pump Test Rig
Document5 pages
Instruction Manual: Centrifugal Pump Test Rig
zarrro
No ratings yet
MONITORING OF WATER QUALITY IN A SHRIMP FARM USING A FANET - Final - Revised - Version
Document26 pages
MONITORING OF WATER QUALITY IN A SHRIMP FARM USING A FANET - Final - Revised - Version
Line Pham
No ratings yet
Feasibility Study Definition - How Does It Work
Document10 pages
Feasibility Study Definition - How Does It Work
aliranag
No ratings yet
Cat Electronic Technician 2014A v1.0 Product Status Report
Document12 pages
Cat Electronic Technician 2014A v1.0 Product Status Report
George Zormpas
No ratings yet
Anomaly Detection Principles and Algorithms: Mehrotra, K.G., Mohan, C.K., Huang, H
Document1 page
Anomaly Detection Principles and Algorithms: Mehrotra, K.G., Mohan, C.K., Huang, H
Indi Wei-Huan Hu
No ratings yet
Science and Technology: A Torch That Illuminates The Country
Document3 pages
Science and Technology: A Torch That Illuminates The Country
mayette
No ratings yet
Product Reference Sheet WEL PST 100 CNP ITH 2960 0530 00 EN
Document4 pages
Product Reference Sheet WEL PST 100 CNP ITH 2960 0530 00 EN
Carlos Onza
No ratings yet
Masoneilan+ +78400+and+18400+Series+LincolnLog+IOM
Document28 pages
Masoneilan+ +78400+and+18400+Series+LincolnLog+IOM
Balasubramaniam Muthusamy
No ratings yet