awesome-multi-modal
github.com/zipzou/awesome-multi-modal ↗The paper list in the multimodal domain.
0
GitHub Stars
63
Curated Resources
4
Categories
18 hours ago
Last Refreshed
Textual Large Language Model BackboneVision Model BackboneVision LLM for GenerationImage Generation
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me gan paradigm resources from awesome-multi-modal"
Installation instructions →What's inside
Image Generation
- https://arxiv.org/abs/1406.2661GAN Paradigm
- https://arxiv.org/abs/1812.04948GAN Paradigm
- https://arxiv.org/abs/2006.11239Diffusion Paradigm
- https://arxiv.org/abs/2105.13290Augoregressive or MLM Paradigm in Discrete Space
- https://arxiv.org/abs/2110.04627GAN Paradigm
- https://arxiv.org/abs/2112.10752Diffusion Paradigm
Vision Model Backbone
Textual Large Language Model Backbone
Showing a sample of 63 resources. View the full list on GitHub →