Professional Documents
Culture Documents
In Production
Lương Anh Tuấn: 2022-11-24
https://www.datarobot.com/wiki/machine-learning-life-cycle/
https://www.tecton.ai/blog/what-is-a-feature-store/
copyright© Asilla, Inc.
Feature Store Example
https://www.tecton.ai/
https://ml-ops.org/content/mlops-principles#experiments-tracking
https://github.com/EthicalML/awesome-prod
uction-machine-learning
https://github.com/lfai/lfai-landscape
MLOps
Principles Data ML Model Code
Versioning 1) Data preparation pipelines 1) ML model training pipeline 1) Application code
2) Features store 2) ML model (object) 2) Configurations
3) Datasets 3) Hyperparameters
4) Metadata 4) Experiment tracking
Testing 1) Data Validation (error detection) 1) Model specification is unit tested 1) Unit testing
2) Feature creation unit testing 2) ML model training pipeline is integration tested 2) Integration testing for the end-to-end
3) ML model is validated before being operationalized pipeline
4) ML model staleness test (in production)
5) Testing ML model relevance and correctness
6) Testing non-functional requirements (security, fairness, interpretability)
Automation 1) Data transformation 1) Data engineering pipeline 1) ML model deployment with CI/CD
2) Feature creation and manipulation 2) ML model training pipeline 2) Application build
3) Hyperparameter/Parameter selection
Reproducibility 1) Backup data 1) Hyperparameter tuning is identical between dev and prod 1) Versions of all dependencies in dev
2) Data versioning 2) The order of features is the same and prod are identical
3) Extract metadata 3) Ensemble learning: the combination of ML models is same 2) Same technical stack for dev and
4) Versioning of feature engineering 4)The model pseudo-code is documented production environments
3) Reproducing results by providing
container images or virtual machines
Deployment 1) Feature store is used in dev and prod 1) Containerization of the ML stack 1) On-premise, cloud, or edge
environments 2) REST API
3) On-premise, cloud, or edge
Monitoring 1) Data distribution changes (training vs. serving 1) ML model decay 1) Predictive quality of the application on
data) 2) Numerical stability serving data
2) Training vs serving features 3) Computational performance of the ML model
https://ml-ops.org/content/mlops-principles#summary-of-mlops-principles-and-best-practices
copyright© Asilla, Inc.
Best practices to MLOps principles
https://ml-ops.org/content/mlops-principles#summary-of-mlops-principles-and-best-practices
https://github.com/alexeygrigorev/mlbookcamp-code https://mlops-guide.github.io/Workflow/
https://github.com/paiml/practical-mlops-book https://ml-ops.org/
https://mlops.community/blog/ https://stanford-cs329s.github.io/syllabus.html
https://github.com/PacktPublishing/The-Machine-Learning https://github.com/DataTalksClub/mlops-zoomcamp
-Solutions-Architect-Handbook
https://github.com/kennethleungty/MLOps-Specialization-Notes
https://drive.google.com/drive/folders/1GpaAv1KhNNtYg3EsI6RpufFj8Tylv0UQ?usp=share_link
https://www.run.ai/guides/machine-learning-operations