awesome-multimodal-convai
github.com/holylovenia/awesome-multimodal-convai ↗Paper reading list for Multimodal Conversational AI
4
GitHub Stars
77
Curated Resources
2
Categories
22 hours ago
Last Refreshed
:bookmark_tabs: Research Papers:bookmark: Articles, Tutorials, and Presentations
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me :face_in_clouds: convai tasks resources from awesome-multimodal-convai"
Installation instructions →What's inside
:bookmark_tabs: Research Papers
- A Knowledge-Grounded Multimodal Search-Based Conversational Agent:face_in_clouds: ConvAI Tasks
- Assessing Multilingual Fairness in Pre-trained Multimodal RepresentationsAnalysis
- A survey on deep learning for multimodal data fusion:monocle_face: Multimodal Machine Learning
- Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog:face_in_clouds: ConvAI Tasks
- ChiCo: A Multimodal Corpus for the Study of Child Conversation:card_file_box: Dataset and Challenges
- CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog:card_file_box: Dataset and Challenges
:bookmark: Articles, Tutorials, and Presentations
- Guest Editorial: Image and Language Understanding
- Multimodal Learning and Reasoning
- Multimodal Machine Learning
- Multimodal Machine Learning: Integrating Language, Vision and Speech
- Neural Approaches to Conversational AI
- Towards Multimodal Human-Like Characteristics and Expressive Visual Prosody in Virtual Agents
Showing a sample of 77 resources. View the full list on GitHub →