Skip to main content

A curated list of awesome Machine Learning frameworks, libraries and software.

73k
GitHub Stars
1.2k
Curated Resources
38
Categories
1 hour ago
Last Refreshed
IMPORTANT NOTE ON PRs:APLCC++Common LispClojureCrystalCUDA PTXElixirErlangFortranGoHaskellJavaJavaScriptJuliaKotlinLuaMatlab.NETObjective COCamlOpenCVPerlPerl 6PHPPythonRubyRustRSASScalaSchemeSwiftTensorFlowToolsBooksCredits

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me general-purpose machine learning resources from awesome-machine-learning"

Installation instructions →

What's inside

PHP

  • 19 QuestionsGeneral-Purpose Machine Learning

    A machine learning / bayesian inference assigning attributes to objects.

Python

.NET

  • Accord-Framework

  • Accord.MachineLearning

    Support Vector Machines, Decision Trees, Naive Bayesian models, K-means, Gaussian Mixture models and general algorithms such as Ransac, Cross-validation and Grid-Search for machine-learning applications. This package is part of the Accord.NET Framework.

  • Accord.NET

    Together with AForge.NET, this library can provide image processing and computer vision algorithms to Windows, Windows RT and Windows Phone. Some components are also available for Java and Android.

  • AForge.NET

    Open source C# framework for developers and researchers in the fields of Computer Vision and Artificial Intelligence. Development has now shifted to GitHub.

Scala

  • adam

    A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.

  • Algebird

    Abstract Algebra for Scala.

  • BIDMach

    CPU and GPU-accelerated Machine Learning Library.

  • BIDMat

    CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.

  • bioscala

    Bioinformatics for the Scala programming language

  • Breeze

    Breeze is a numerical processing library for Scala.

Java

  • aerosolve

    A machine learning library by Airbnb designed from the ground up to be human friendly.

  • AMIDST Toolbox

    A Java Toolbox for Scalable Probabilistic Machine Learning.

  • Apache cTAKES

    Apache Clinical Text Analysis and Knowledge Extraction System (cTAKES) is an open-source natural language processing system for information extraction from electronic medical record clinical free-text.

  • Chips-n-Salsa

    A Java library for genetic algorithms, evolutionary computation, and stochastic local search, with a focus on self-adaptation / self-tuning, as well as parallel execution.

  • ClearTK

    ClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA.

  • CMU Sphinx

    Open Source Toolkit For Speech Recognition purely based on Java speech recognition library.

Tools

  • Agentfield

    Open source Kubernetes-style control plane for deploying AI agents as distributed microservices, with built-in service discovery, durable workflows, and observability.

  • Agentic Radar

    Open-source CLI security scanner for agentic workflows. Scans your workflow’s source code, detects vulnerabilities, and generates an interactive visualization along with a detailed security report. Supports LangGraph, CrewAI, n8n, OpenAI Agents, and more.

  • Agentic Signal

    Visual AI agent workflow automation platform with local LLM integration. Build intelligent workflows using drag-and-drop, no cloud required.

  • Ambrosia

    Ambrosia helps you clean up your LLM datasets using

  • Aqueduct

    Aqueduct enables you to easily define, run, and manage AI & ML tasks on any cloud infrastructure.

  • Arize AI

    Model validation and performance monitoring, drift detection, explainability, visualization across structured and unstructured data

C++

  • Agentic Context Engine

  • BanditLib

    A simple Multi-armed Bandit library.

  • BLLIP Parser

    BLLIP Natural Language Parser (also known as the Charniak-Johnson parser).

  • CRF++

    Open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data & other Natural Language Processing tasks.

  • CRFsuite

    CRFsuite is an implementation of Conditional Random Fields (CRFs) for labeling sequential data.

  • CUDA

    This is a fast C++/CUDA implementation of convolutional [DEEP LEARNING]

R

  • ahaz

    ahaz: Regularization for semiparametric additive hazards regression.

  • arules

    arules: Mining Association Rules and Frequent Itemsets

  • biglasso

    biglasso: Extending Lasso Model Fitting to Big Data in R.

  • bmrm

    bmrm: Bundle Methods for Regularized Risk Minimization Package.

  • Boruta

    Boruta: A wrapper algorithm for all-relevant feature selection.

  • bst

    bst: Gradient Boosting.

Showing a sample of 1.2k resources. View the full list on GitHub →