Skip to main content

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1.7k
GitHub Stars
235
Curated Resources
7
Categories
55 min ago
Last Refreshed
PapersBooksTalks and TutorialsSoftwareBenchmarks and DatasetsConferencesJournals

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me memory/cache modeling/analysis resources from awesome-machine-learning-in-compilers"

Installation instructions →

What's inside

Papers

Benchmarks and Datasets

  • ANGHABENCH

    A suite with One Million Compilable C Benchmarks (

  • BHive

    A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models (

  • cBench

    32 C benchmarks with datasets and driver scripts.

  • CodeXGLUE

    A Machine Learning Benchmark Dataset for Code Understanding and Generation (

  • DeepDataFlow

    469k LLVM-IR files and 8.6B data-flow analysis labels for classification (

  • devmap

    650 OpenCL benchmark features and CPU/GPU classification labels (

Books

Software

  • clgen

    Benchmark generator using LSTMs (

  • COBAYN

    Compiler Autotuning using BNs (

  • CodeBert

    pre-trained DNN models for programming languages (

  • CompilerGym

    Reinforcement learning environments for compiler optimizations (

  • IR2Vec

    LLVM IR based program embeddings for machine learning (

  • IREE

    A retargetable MLIR-based machine learning compiler and runtime toolkit.

Showing a sample of 235 resources. View the full list on GitHub →