awesome-efficient-cot-reasoning-summary
github.com/zwxandy/awesome-efficient-cot-reasoning-summary ↗🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me prompting-guided cot compression resources from awesome-efficient-cot-reasoning-summary"
Installation instructions →What's inside
Prompting-guided CoT Compression
- Break the Chain: Large Language Models Can be Shortcut Reasoners
arXiv 2024.6.4
- Chain of Draft: Thinking Faster by Writing Less
arXiv 2025.2.25
- Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
ICLR 2024
- Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
arXiv 2025.3.7
- Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution
arXiv 2025.4.13
Training-internized CoT Compression
- C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
AAAI 2025
- Can Language Models Learn to Skip Steps?
NeurIPS 2024
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning
arXiv 2025.2.13
- LightThinker: Thinking Step-by-Step Compression
arXiv 2025.2.21
- O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
arXiv 2025.1.29
- Self-Training Elicits Concise Reasoning in Large Language Models
arXiv 2025.2.28
Chain of Model
- Chain-of-Model Learning for Language Model
arXiv 2025.5.17
Latent-space CoT Reasoning
- CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation
arXiv 2025.2.28
- Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
arXiv 2025.2.12
- Reasoning with Latent Thoughts: On the Power of Looped Transformers
ICLR 2025
- SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs
arXiv 2025.2.17
Inference-time CoT Compression
- Dynamic Early Exit in Reasoning Models
arXiv 2025.4.22
- Efficient Long-Decoding Inference with Reasoning-Aware Attention Sparsity
arXiv 2025.2.16
- Entropy-based Exploration Conduction for Multi-step Reasoning
arXiv 2025.3.20
- $\phi$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation
arXiv 2025.3.17
Recent Survey
Multimodal CoT Reasoning
- Efficient Reasoning with Hidden Thinking
arXiv 2025.1.31
- GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
arXiv 2025.3.13
- Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models
arXiv 2024.5.29
- Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
arXiv 2025.1.13
- Interleaved-Modal Chain-of-Thought
arXiv 2025.3.17
- Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
NeurIPS 2022
Analysis of CoT Compression
Showing a sample of 38 resources. View the full list on GitHub →