awesome-mlops
github.com/kelvins/awesome-mlops ↗:sunglasses: A curated list of awesome MLOps tools
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me optimization tools resources from awesome-mlops"
Installation instructions →What's inside
Optimization Tools
- Accelerate
A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision.
- Dask
Provides advanced parallelism for analytics, enabling performance at scale for the tools you love.
- DeepSpeed
Deep learning optimization library that makes distributed training easy, efficient, and effective.
- Fiber
Python distributed computing library for modern computer clusters.
- Horovod
Distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
- Mahout
Distributed linear algebra framework and mathematically expressive Scala DSL.
Hyperparameter Tuning
- Advisor
Open-source implementation of Google Vizier for hyper parameters tuning.
- Hyperas
A very simple wrapper for convenient hyperparameter optimization.
- Hyperopt
Distributed Asynchronous Hyperparameter Optimization in Python.
- Katib
Kubernetes-based system for hyperparameter tuning and neural architecture search.
- KerasTuner
Easy-to-use, scalable hyperparameter optimization framework.
Model Lifecycle
- Aeromancy
A framework for performing reproducible AI and ML for Weights and Biases.
- Aim
A super-easy way to record, search and compare 1000s of ML training runs.
- Cascade
Library of ML-Engineering tools for rapid prototyping and experiment management.
- Comet
Track your datasets, code changes, experimentation history, and models.
- Guild AI
Open source experiment tracking, pipeline automation, and hyperparameter tuning.
- Keepsake
Version control for machine learning with support to Amazon S3 and Google Cloud Storage.
Websites
Events
- AI Conference Deadline
- MLOps Conference - Keynotes and Panels
Keynotes and Panels
Model Fairness and Privacy
Books
Data Processing
- Airflow
Platform to programmatically author, schedule, and monitor workflows.
- Azkaban
Batch workflow job scheduler created at LinkedIn to run Hadoop jobs.
- Dagster
A data orchestrator for machine learning, analytics, and ETL.
- Hadoop
Framework that allows for the distributed processing of large data sets across clusters.
Showing a sample of 314 resources. View the full list on GitHub →