The New Stack and Ops For AI - LLMOps

Uploaded by

JOSE ALBERTO ARANGO SÁNCHEZ

0% found this document useful (0 votes)

10 views12 pages

Original Title

The New Stack and Ops for AI - LLMOps

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views12 pages

The New Stack and Ops For AI - LLMOps

Uploaded by

JOSE ALBERTO ARANGO SÁNCHEZ

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 12

Search inside document

The New Stack and Ops for AI

https://www.youtube.com/watch?v=XGJNo8TpuVA
Going from the prototype to production

Framework to help guide you moving your app from prototype into production
1
. User Experience - Challenges
Control for uncertainly
Build guardrails for streerability and safety
Aspect building Transparent UX
Keep the human in the loop

To communicate the system’s capabilities and limitations

Guide the users Human/AI Collaboration

Guardrails = Safety controls for LLMs

Guardrails are essential for UX, especially from applications in regulated industries
. Model Consistency
Constrain model behavior
JSON Mode => allows you to force the model to output JSON, new
parameter json_schema …..
Reproducible output: You can get significantly more reproducible
output using the seed parameter

Ground the model (using knowledge store)

In the input context, explicitly give the model “grounded facts” to
reduce the likelihood of hallucinations (like a RAG)
Example:

Function call API

. Call API function

. Structured answer

Grounded Fact Source

Grounded Fact Sources
Search index
Retrieval
Database
Etc

. Evaluating Performance - Strategies to evaluate

1. Create eval suites for your specific use cases. https://github.com/openai/
evals
Long and track your eval runs

2. Model - graded evals (Using AI to grade AI)

GPT4-Strong evaluator Binary metric
Metric closely correclated with what your users would expect
IF GPT-4 is expensive/slow for evals, you can fine-tune a 3.5 “judge” by distilling
GPT-4 Output
. Managing latency & Cost (orchestration)
Two strategies in managing cost and latency involve:
1. Using semantic caching (reduce the number of round trips that
you’re making)
2. Route to cheaper models

Solved

Pricing
Summary graph

LLMOps

AI and DevOps
Document9 pages
AI and DevOps
Gaurav Sharma
No ratings yet
Final Project - Activity Recognition
Document7 pages
Final Project - Activity Recognition
Elen Nguyễn
No ratings yet
Lecture 3 Simulaton
Document32 pages
Lecture 3 Simulaton
Asad Butt
No ratings yet
Building An Automation Framework Around Open Source Technologies
Document11 pages
Building An Automation Framework Around Open Source Technologies
skodli
No ratings yet
Project Deep Learning
Document4 pages
Project Deep Learning
Prasu Muthyalapati
No ratings yet
Developing Custom Controls With UI5: Michael Graf, UI5 Developer
Document16 pages
Developing Custom Controls With UI5: Michael Graf, UI5 Developer
Nageswar Vattikuti
No ratings yet
Building AI
Document8 pages
Building AI
chris.tiono
No ratings yet
Automation Framework Guidelines
Document11 pages
Automation Framework Guidelines
Chandra Sekhar
No ratings yet
Mock Based Unit Testing
Document17 pages
Mock Based Unit Testing
Udayan Datta
No ratings yet
First
Document35 pages
First
thesoulmatecreation
No ratings yet
Test Automation Let's Talk Business
Document36 pages
Test Automation Let's Talk Business
Ankur Singh
No ratings yet
The Use of Computer Simulation in Wareho
Document6 pages
The Use of Computer Simulation in Wareho
Axel Martinez
No ratings yet
3 What Are Your Considerations For Your Automation Framework and Why
Document11 pages
3 What Are Your Considerations For Your Automation Framework and Why
Rathod Gopal
No ratings yet
Cs8582-Object Orientedanalysis and Design Laboratory
Document52 pages
Cs8582-Object Orientedanalysis and Design Laboratory
reazaurrahman0786
No ratings yet
What Is Framework-Unit6
Document11 pages
What Is Framework-Unit6
Govada Dhana
No ratings yet
Data Science
Document38 pages
Data Science
DINESH REDDY
No ratings yet
Microservices Architecture
Document2 pages
Microservices Architecture
Moisés Quispe
No ratings yet
Emaus Vector Canoe Slides
Document12 pages
Emaus Vector Canoe Slides
vlsishekar
No ratings yet
WQP GRN Onboarding Document - How To Use Our Platform
Document19 pages
WQP GRN Onboarding Document - How To Use Our Platform
Akash Kumar
No ratings yet
Hybrid Models in Chemicals: Leveraging Industrial AI To Overcome Operational Challenges
Document11 pages
Hybrid Models in Chemicals: Leveraging Industrial AI To Overcome Operational Challenges
Tua Halomoan
No ratings yet
Using Dynatrace Monitoring Data For Generating Performance Models of Java EE Applications PDF
Document2 pages
Using Dynatrace Monitoring Data For Generating Performance Models of Java EE Applications PDF
anthony cesar ortiz arteaga
No ratings yet
Autonomie - Training - Part1
Document38 pages
Autonomie - Training - Part1
muradbashir
No ratings yet
Final Project Report
Document34 pages
Final Project Report
Sahadev Marik
No ratings yet
Automation Testing
Document33 pages
Automation Testing
Ketan Ashara
No ratings yet
Final Year Project
Document41 pages
Final Year Project
Kartik Kulkarni
No ratings yet
30k S4hana2022 BPD en XX
Document10 pages
30k S4hana2022 BPD en XX
Roberto De Flumeri
No ratings yet
AIDI - 1010 - WEEK2 - Google Colab - v1.2
Document17 pages
AIDI - 1010 - WEEK2 - Google Colab - v1.2
Shafat Khan
No ratings yet
Test Automation Framework & Design For XXXXX Project: Author: XXXXXX
Document14 pages
Test Automation Framework & Design For XXXXX Project: Author: XXXXXX
qtpencyclopedia
No ratings yet
Deploying ML Production (Flask - API)
Document27 pages
Deploying ML Production (Flask - API)
priyankar sinha
No ratings yet
Pemodelan Dan Simulasi
Document40 pages
Pemodelan Dan Simulasi
Tak Sebatas Buku
No ratings yet
HP Application Lifecycle Management: Business Process Models Best Practices Guide
Document20 pages
HP Application Lifecycle Management: Business Process Models Best Practices Guide
praveen_forum
No ratings yet
MLOps Asilla 20221124
Document16 pages
MLOps Asilla 20221124
khanh chuhuu
No ratings yet
Autocompete: A Framework For Machine Learning Competitions: Icml 2015 Automl Workshop
Document9 pages
Autocompete: A Framework For Machine Learning Competitions: Icml 2015 Automl Workshop
san
No ratings yet
01 Chapter 01
Document50 pages
01 Chapter 01
Mario Hany
No ratings yet
Demand in Eco
Document12 pages
Demand in Eco
ranvijaylucky.rk
No ratings yet
Splunk Fundamentals 1 Lab Exercises: Lab Module 9 - Transforming Commands
Document14 pages
Splunk Fundamentals 1 Lab Exercises: Lab Module 9 - Transforming Commands
jaaaaaheue
No ratings yet
Project Cdac
Document4 pages
Project Cdac
abhishekbayas103
No ratings yet
Ideas For Modeling and Simulation of Supply Chains With Arena
Document10 pages
Ideas For Modeling and Simulation of Supply Chains With Arena
Ebenezer Bright
No ratings yet
4 Automation-Testing
Document33 pages
4 Automation-Testing
hello ninja
No ratings yet
Test Automation - Business v2
Document36 pages
Test Automation - Business v2
Guntej Singh
No ratings yet
ML Projects For Final Year
Document7 pages
ML Projects For Final Year
Alia Khan
No ratings yet
SMA Lab Manual 2
Document24 pages
SMA Lab Manual 2
Rahul Pandey
No ratings yet
TUGx-Abstracts 190702 PDF
Document15 pages
TUGx-Abstracts 190702 PDF
Mahesh Mahi
No ratings yet
HW SW Lecture15 Simumlation
Document22 pages
HW SW Lecture15 Simumlation
Akshay Doshi
No ratings yet
Simulation Software
Document20 pages
Simulation Software
Karan Dadhania
100% (1)
NEW LAUNCH REPEAT 1 New Amazon SageMaker Notebook Experience Share & Collaborate at Scale AIM230-R1
Document34 pages
NEW LAUNCH REPEAT 1 New Amazon SageMaker Notebook Experience Share & Collaborate at Scale AIM230-R1
Tony
No ratings yet
E Commercetestingframework
Document8 pages
E Commercetestingframework
stgaikwad100
No ratings yet
Hci 2 2019 Reviewer
Document2 pages
Hci 2 2019 Reviewer
Albert Amado
No ratings yet
Generative AI For Business With Microsoft Azure Open AI Program
Document17 pages
Generative AI For Business With Microsoft Azure Open AI Program
Sudhanshu
No ratings yet
Data Analytics PDF
Document6 pages
Data Analytics PDF
Prazavi Jain
0% (1)
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Software Architecture with Python
From Everand
Software Architecture with Python
Anand Balachandran Pillai
No ratings yet
Nmaist Project Presentation
Document27 pages
Nmaist Project Presentation
Davis David
No ratings yet
Object-Oriented Analysis (OOA) Looks at The Problem Domain, With The Aim of
Document6 pages
Object-Oriented Analysis (OOA) Looks at The Problem Domain, With The Aim of
Muhammad Haziq
No ratings yet
Testing Approach For Automatic Test Case Generation and Optimization Using GA
Document3 pages
Testing Approach For Automatic Test Case Generation and Optimization Using GA
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Performance of Systemverilog Coding
Document8 pages
Performance of Systemverilog Coding
max
No ratings yet
Roadmap For ML Algorithm Based Predictive Maintenance
Document4 pages
Roadmap For ML Algorithm Based Predictive Maintenance
Nidhi Kulkarni
No ratings yet
Exercise Book Level 3 - Update 2023 FULL
Document62 pages
Exercise Book Level 3 - Update 2023 FULL
Crazzyguy Foru
No ratings yet
Exercise Book Level 3 - Update 2023 For Students
Document32 pages
Exercise Book Level 3 - Update 2023 For Students
Crazzyguy Foru
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet