awesome-machine-learning
github.com/josephmisiti/awesome-machine-learning ↗A curated list of awesome Machine Learning frameworks, libraries and software.
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me general-purpose machine learning resources from awesome-machine-learning"
Installation instructions →What's inside
PHP
- 19 QuestionsGeneral-Purpose Machine Learning
A machine learning / bayesian inference assigning attributes to objects.
Python
- 2012-paper-diginorm
- acme
An Open Source Distributed Framework for Reinforcement Learning that makes build and train your agents easily.
- A gallery of interesting IPython notebooks
- Aim
- albumentations
А fast and framework agnostic image augmentation library that implements a diverse set of augmentation techniques. Supports classification, segmentation, detection out of the box. Was used to win a number of Deep Learning competitions at Kaggle, Topcoder and those that were a part of the CVPR workshops.
- Allen Downey’s Data Science Course
Code for Data Science at Olin College, Spring 2014.
.NET
- Accord-Framework
- Accord.MachineLearning
Support Vector Machines, Decision Trees, Naive Bayesian models, K-means, Gaussian Mixture models and general algorithms such as Ransac, Cross-validation and Grid-Search for machine-learning applications. This package is part of the Accord.NET Framework.
- Accord.NET
Together with AForge.NET, this library can provide image processing and computer vision algorithms to Windows, Windows RT and Windows Phone. Some components are also available for Java and Android.
- AForge.NET
Open source C# framework for developers and researchers in the fields of Computer Vision and Artificial Intelligence. Development has now shifted to GitHub.
Scala
- adam
A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
- Algebird
Abstract Algebra for Scala.
- BIDMach
CPU and GPU-accelerated Machine Learning Library.
- BIDMat
CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.
- bioscala
Bioinformatics for the Scala programming language
- Breeze
Breeze is a numerical processing library for Scala.
Java
- aerosolve
A machine learning library by Airbnb designed from the ground up to be human friendly.
- AMIDST Toolbox
A Java Toolbox for Scalable Probabilistic Machine Learning.
- Apache cTAKES
Apache Clinical Text Analysis and Knowledge Extraction System (cTAKES) is an open-source natural language processing system for information extraction from electronic medical record clinical free-text.
- Chips-n-Salsa
A Java library for genetic algorithms, evolutionary computation, and stochastic local search, with a focus on self-adaptation / self-tuning, as well as parallel execution.
- ClearTK
ClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA.
- CMU Sphinx
Open Source Toolkit For Speech Recognition purely based on Java speech recognition library.
Tools
- Agentfield
Open source Kubernetes-style control plane for deploying AI agents as distributed microservices, with built-in service discovery, durable workflows, and observability.
- Agentic Radar
Open-source CLI security scanner for agentic workflows. Scans your workflow’s source code, detects vulnerabilities, and generates an interactive visualization along with a detailed security report. Supports LangGraph, CrewAI, n8n, OpenAI Agents, and more.
- Agentic Signal
Visual AI agent workflow automation platform with local LLM integration. Build intelligent workflows using drag-and-drop, no cloud required.
- Ambrosia
Ambrosia helps you clean up your LLM datasets using
- Aqueduct
Aqueduct enables you to easily define, run, and manage AI & ML tasks on any cloud infrastructure.
- Arize AI
Model validation and performance monitoring, drift detection, explainability, visualization across structured and unstructured data
C++
- Agentic Context Engine
- BanditLib
A simple Multi-armed Bandit library.
- BLLIP Parser
BLLIP Natural Language Parser (also known as the Charniak-Johnson parser).
- CRF++
Open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data & other Natural Language Processing tasks.
- CRFsuite
CRFsuite is an implementation of Conditional Random Fields (CRFs) for labeling sequential data.
- CUDA
This is a fast C++/CUDA implementation of convolutional [DEEP LEARNING]
R
- ahaz
ahaz: Regularization for semiparametric additive hazards regression.
- arules
arules: Mining Association Rules and Frequent Itemsets
- biglasso
biglasso: Extending Lasso Model Fitting to Big Data in R.
- bmrm
bmrm: Bundle Methods for Regularized Risk Minimization Package.
- Boruta
Boruta: A wrapper algorithm for all-relevant feature selection.
- bst
bst: Gradient Boosting.
Showing a sample of 1.2k resources. View the full list on GitHub →