Skip to main content

Awesome machine learning model compression research papers, quantization, tools, and learning material.

544
GitHub Stars
116
Curated Resources
4
Categories
17 hours ago
Last Refreshed
PapersArticlesToolsLicense

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me quantization resources from awesome-ml-model-compression"

Installation instructions →

What's inside

Papers

Articles

Tools

  • BitsandbytesLibraries

  • facebookresearch/kill-the-bitsPaper Implementations

    code and compressed models for the paper, "And the bit goes down: Revisiting the quantization of neural networks" by Facebook AI Research.

  • NNCPLibraries

    An experiment to build a practical lossless data compressor with neural networks. The latest version uses a Transformer model (slower but best ratio). LSTM (faster) is also available.

  • TensorFlow Model Optimization ToolkitLibraries

  • XNNPACKLibraries

Showing a sample of 116 resources. View the full list on GitHub →