Skip to content

Latest commit

 

History

History
137 lines (111 loc) · 10.4 KB

mlops.md

File metadata and controls

137 lines (111 loc) · 10.4 KB

This page covers MLOps and DataOps/Eng topics. I started this in early 2020. Also see dataeng

General Articles, Blogs, and Talks

Principles & Meta

Practices

Maturity Modesl

Relevant Research & Serious Articles

Conferences & Workshops

Tooling

Dagster

Metaflow (Netflix)

Mlfow (DataBricks)

  • Simplifying Model Management with MLflow
  • mlfow - an open source platform for managing the end-to-end machine learning lifecycle. It tackles three primary functions: 1) tracking experiments to record and compare parameters and results 2) packaging ML code in a reusable, reproducible form in order to share with other data scientists or transfer to production 3) managing and deploying models from a variety of ML libraries to a variety of model serving and inference platforms (MLflow Models). By Databricks

Feature Stores

Kubeflow

General K8s

  • Seldon - Seldon Core an open source platform for deploying machine learning models on a Kubernetes cluster.
  • Argo CD
  • Flyte - a structured programming and distributed processing platform created at Lyft that enables highly concurrent, scalable and maintainable workflows for machine learning and data processing.
  • [

Mleap

Spark

Data Versioning & Validation

  • DVC - is built to make ML models shareable and reproducible. It is designed to handle large files, data sets, machine learning models, and metrics as well as code.

DBT

Great Expectations

Other Open Source Tools

Lineage and Metadata Management

Observability

Vendors

Cloud Offerings

Google

Azure