awesome-deep-reasoning

Collect every awesome work about r1!

433

GitHub Stars

109

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me replicates of deepseek-r1 and deepseek-r1-zero resources from awesome-deep-reasoning"

32B-DeepSeek-R1-ZeroReplicates of DeepSeek-R1 and DeepSeek-R1-Zero
HuggingFace Open R1Replicates of DeepSeek-R1 and DeepSeek-R1-Zero
Logic-RLReplicates of DeepSeek-R1 and DeepSeek-R1-Zero
Reproduce R1 Zero on Logic Puzzle
oatllmReplicates of DeepSeek-R1 and DeepSeek-R1-Zero
Open-R1-MultimodalAdvanced Reasoning for Multi-Modal
A fork to add multimodal model training to open-r1
Open-Reasoner-ZeroReplicates of DeepSeek-R1 and DeepSeek-R1-Zero

3FS
[DeepSeek] A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
DeepGEMM
[DeepSeek] Clean and efficient FP8 GEMM kernels with fine-grained scaling
DualPipe
[DeepSeek] DualPipe achieves full overlap of forward and backward computation-communication phases, also reducing pipeline bubbles.
https://github.com/deepseek-ai/FlashMLA
https://github.com/hiyouga/EasyR1
https://github.com/huggingface/Math-Verify

AIME-2024
This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2024.
AIME-VALIDATION
All 90 problems come from AIME 22, AIME 23, and AIME 24
aimo-validation-amc
All 83 samples come from AMC12 2022, AMC12 2023
Best practice for evaluating R1/o1-like reasoning models
Codeforces-Python-Submissions
A dataset of Python submissions from Codeforces.
GPQA-Diamond
Diamond subset from GPQA benchmark.

BAAI-TACO
TACO is a benchmark for code generation with 26443 problems.
Bespoke-Stratos-17k
A reasoning dataset of questions, reasoning traces, and answers.
clevr_cogen_a_train
A R1-distilled visual reasoning dataset.
Clevr_CoGenT_TrainA_R1
A multi-modal dataset for training MM R1 model.
HuggingFace
HuggingFace
800k samples dataset to train DeepSeek-R1 Distill models.

Showing a sample of 109 resources. View the full list on GitHub →