Context Awesome

awesome-multi-modal

github.com/zipzou/awesome-multi-modal ↗

The paper list in the multimodal domain.

0

GitHub Stars

63

Curated Resources

4

Categories

13 hours ago

Last Refreshed

Textual Large Language Model BackboneVision Model BackboneVision LLM for GenerationImage Generation

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me gan paradigm resources from awesome-multi-modal"

Installation instructions →

What's inside

Image Generation

https://arxiv.org/abs/1406.2661GAN Paradigm
https://arxiv.org/abs/1812.04948GAN Paradigm
https://arxiv.org/abs/2006.11239Diffusion Paradigm
https://arxiv.org/abs/2105.13290Augoregressive or MLM Paradigm in Discrete Space
https://arxiv.org/abs/2110.04627GAN Paradigm
https://arxiv.org/abs/2112.10752Diffusion Paradigm

Vision Model Backbone

Textual Large Language Model Backbone

Vision LLM for Generation

Showing a sample of 63 resources. View the full list on GitHub →