Skip to main content

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1.5k
GitHub Stars
117
Curated Resources
5
Categories
23 hours ago
Last Refreshed
🔔 News🛠️ Stage 1: Tool-Driven Visual Exploration💻 Stage 2: Programmatic Visual Manipulation🎨 Stage 3: Intrinsic Visual Imagination📊 Evaluation & Benchmarks

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me ➤ benchmarks for thinking with images resources from awesome_think_with_images"

Installation instructions →

What's inside

Showing a sample of 117 resources. View the full list on GitHub →