awesome-mcot

github.com/yaotingwangofficial/awesome-mcot ↗

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

GitHub Stars

236

Curated Resources

Categories

20 hours ago

Last Refreshed

Tab-1: Datasets for MCoT Training with Rationale.Tab-2: Benchmarks for MCoT Evaluation without Rationale.Tab-3: Benchmarks for MCoT Evaluation with Rationale.MCoT Reasoning Over ImageMCoT Reasoning Over VideoMCoT Reasoning Over 3DMCoT Reasoning Over Audio and SpeechMCoT Reasoning Over Table and ChartCross-modal CoT ReasoningRationale ConstructionStructural ReasoningInformation EnhancingObjective GranularityMultimodal RationaleTest-time ScalingEmbodied AIAgentic SystemAutonomous DrivingMedical and HealthcareSocial and HumanMultimodal Generation

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me survey resources from awesome-mcot"

Installation instructions →

What's inside

Multimodal Generation

MCoT Reasoning Over 3D

Tab-2: Benchmarks for MCoT Evaluation without Rationale.

Embodied AI

MCoT Reasoning Over Video

MCoT Reasoning Over Image

Tab-1: Datasets for MCoT Training with Rationale.

A-OKVQA
2022
EgoCoT
2023
EMMA-X
2024
LLaVA-CoT-100k
2024
M3CoT
2024
MAmmoTH-VL
2024

Rationale Construction

Showing a sample of 236 resources. View the full list on GitHub →