fucking-awesome-machine-learning
github.com/correia-jpv/fucking-awesome-machine-learning ↗A curated list of awesome Machine Learning frameworks, libraries and software. With repository stars⭐ and forks🍴
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me general-purpose machine learning resources from fucking-awesome-machine-learning"
Installation instructions →What's inside
PHP
- 19 QuestionsGeneral-Purpose Machine Learning
A machine learning / bayesian inference assigning attributes to objects.
Python
- 2012-paper-diginorm
- acme
An Open Source Distributed Framework for Reinforcement Learning that makes build and train your agents easily.
- A gallery of interesting IPython notebooks
- Aim
- albumentations
А fast and framework agnostic image augmentation library that implements a diverse set of augmentation techniques. Supports classification, segmentation, detection out of the box. Was used to win a number of Deep Learning competitions at Kaggle, Topcoder and those that were a part of the CVPR workshops.
- Allen Downey’s Data Science Course
Code for Data Science at Olin College, Spring 2014.
.NET
- Accord-Framework
- Accord.MachineLearning
Support Vector Machines, Decision Trees, Naive Bayesian models, K-means, Gaussian Mixture models and general algorithms such as Ransac, Cross-validation and Grid-Search for machine-learning applications. This package is part of the Accord.NET Framework.
- Accord.NET
Together with AForge.NET, this library can provide image processing and computer vision algorithms to Windows, Windows RT and Windows Phone. Some components are also available for Java and Android.
- AForge.NET
Open source C# framework for developers and researchers in the fields of Computer Vision and Artificial Intelligence. Development has now shifted to GitHub.
Scala
- adam
A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
- Algebird
Abstract Algebra for Scala.
- BIDMach
CPU and GPU-accelerated Machine Learning Library.
- BIDMat
CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.
- bioscala
Bioinformatics for the Scala programming language
- Breeze
Breeze is a numerical processing library for Scala.
Java
- aerosolve
A machine learning library by Airbnb designed from the ground up to be human friendly.
- AMIDST Toolbox
A Java Toolbox for Scalable Probabilistic Machine Learning.
- Apache cTAKES
Apache Clinical Text Analysis and Knowledge Extraction System (cTAKES) is an open-source natural language processing system for information extraction from electronic medical record clinical free-text.
- Chips-n-Salsa
A Java library for genetic algorithms, evolutionary computation, and stochastic local search, with a focus on self-adaptation / self-tuning, as well as parallel execution.
- ClearTK
ClearTK provides a framework for developing statistical natural language processing (NLP) components in Java and is built on top of Apache UIMA.
- CMU Sphinx
Open Source Toolkit For Speech Recognition purely based on Java speech recognition library.
Tools
- Agentfield
Open source Kubernetes-style control plane for deploying AI agents as distributed microservices, with built-in service discovery, durable workflows, and observability.
- Agentic Radar
Open-source CLI security scanner for agentic workflows. Scans your workflow’s source code, detects vulnerabilities, and generates an interactive visualization along with a detailed security report. Supports LangGraph, CrewAI, n8n, OpenAI Agents, and more.
- Agentic Signal
Visual AI agent workflow automation platform with local LLM integration. Build intelligent workflows using drag-and-drop, no cloud required.
- Ambrosia
Ambrosia helps you clean up your LLM datasets using
- Aqueduct
Aqueduct enables you to easily define, run, and manage AI & ML tasks on any cloud infrastructure.
- Arize AI
Model validation and performance monitoring, drift detection, explainability, visualization across structured and unstructured data
C++
- Agentic Context Engine
- BanditLib
A simple Multi-armed Bandit library.
- BLLIP Parser
BLLIP Natural Language Parser (also known as the Charniak-Johnson parser).
- CRF++
Open source implementation of Conditional Random Fields (CRFs) for segmenting/labeling sequential data & other Natural Language Processing tasks.
- CRFsuite
CRFsuite is an implementation of Conditional Random Fields (CRFs) for labeling sequential data.
- CUDA
This is a fast C++/CUDA implementation of convolutional [DEEP LEARNING]
R
- ahaz
ahaz: Regularization for semiparametric additive hazards regression.
- arules
arules: Mining Association Rules and Frequent Itemsets
- biglasso
biglasso: Extending Lasso Model Fitting to Big Data in R.
- bmrm
bmrm: Bundle Methods for Regularized Risk Minimization Package.
- Boruta
Boruta: A wrapper algorithm for all-relevant feature selection.
- bst
bst: Gradient Boosting.
Showing a sample of 1.2k resources. View the full list on GitHub →