Skip to main content

The paper list in the multimodal domain.

0
GitHub Stars
63
Curated Resources
4
Categories
18 hours ago
Last Refreshed
Textual Large Language Model BackboneVision Model BackboneVision LLM for GenerationImage Generation

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me gan paradigm resources from awesome-multi-modal"

Installation instructions →

What's inside

Showing a sample of 63 resources. View the full list on GitHub →