Skip to main content

Probably the best curated list of data science software in Python.

3.5k
GitHub Stars
384
Curated Resources
29
Categories
1 hour ago
Last Refreshed
Machine LearningDeep LearningAutomated Machine LearningNatural Language ProcessingComputer AuditionComputer VisionTime SeriesReinforcement LearningGraph Machine LearningLearning-to-Rank & Recommender SystemsProbabilistic Graphical ModelsProbabilistic MethodsModel ExplanationGenetic ProgrammingOptimizationFeature EngineeringVisualizationDeploymentStatisticsData ManipulationDistributed ComputingExperimentationData ValidationEvaluationComputationsWeb ScrapingSpatial AnalysisQuantum ComputingConversion

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me interactive plots resources from awesome-python-data-science"

Installation instructions →

What's inside

Reinforcement Learning

  • Acme

    A library of reinforcement learning components and agents.

  • Catalyst-RL

    PyTorch framework for RL research.

  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG).

  • d3rlpy

    An offline deep reinforcement learning library.

  • DI-engine

    OpenDILab Decision AI Engine.

  • Dopamine

    A research framework for fast prototyping of reinforcement learning algorithms.

Computations

  • adaptive

    Tools for adaptive and parallel samping of mathematical functions.

  • bottleneck

    Fast NumPy array functions written in C.

  • CuPy

    NumPy-like API accelerated with CUDA.

  • Dask

    Parallel computing with task scheduling.

  • numdifftools

    Solve automatic numerical differentiation problems in one or more variables.

  • NumExpr

    A fast numerical expression evaluator for NumPy that comes with an integrated computing virtual machine to speed calculations up by avoiding memory allocation for intermediate results.

Model Explanation

  • aequitas

    Bias and Fairness Audit Toolkit.

  • AI Explainability 360

    Interpretability and explainability of data and machine learning models.

  • Alibi

    Algorithms for monitoring and explaining machine learning models.

  • anchor

    Code for "High-Precision Model-Agnostic Explanations" paper.

  • Auralisation

    Auralisation of learned features in CNN (for audio).

  • CapsNet-Visualization

    A visualization of the CapsNet layers to better understand how it works.

Evaluation

  • AI Fairness 360

    Fairness metrics for datasets and ML models, explanations, and algorithms to mitigate bias in datasets and models.

  • alibi-detect

    Algorithms for outlier, adversarial and drift detection.

  • Metrics

    Machine learning evaluation metric.

Computer Vision

  • albumentations

    Fast image augmentation library and easy-to-use wrapper around other libraries.

  • Augmentor

    Image augmentation library in Python for machine learning.

  • Decord

    An efficient video loader for deep learning with smart shuffling that's super easy to digest.

  • imgaug

    Image augmentation for machine learning experiments.

  • imgaug_extension

    Additional augmentations for imgaug.

  • KerasCV

    Industry-strength Computer Vision workflows with Keras.

Learning-to-Rank & Recommender Systems

  • allRank

    allRank is a framework for training learning-to-rank neural models based on PyTorch.

  • LightFM

    A Python implementation of LightFM, a hybrid recommendation algorithm.

Statistics

  • Alphalens

    Performance analysis of predictive (alpha) stock factors.

Visualization

  • AltairInteractive plots

    Declarative statistical visualization library for Python. Can easily do many data transformation within the code to create graph

  • animatplotInteractive plots

    A python package for animating plots built on matplotlib.

  • AutoVizAutomatic Plotting

  • BokehInteractive plots

    Interactive Web Plotting for Python.

  • bqplotInteractive plots

    Plotting library for IPython/Jupyter notebooks

  • chartifyGeneral Purposes

    Python library that makes it easy for data scientists to create charts.

Showing a sample of 384 resources. View the full list on GitHub →