awesome-visual-question-answering
github.com/jokieleung/awesome-visual-question-answering ↗A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
672
GitHub Stars
226
Curated Resources
3
Categories
3 hours ago
Last Refreshed
PapersVQA Challenge LeaderboardReference and Acknowledgement
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me 2018 resources from awesome-visual-question-answering"
Installation instructions →What's inside
Papers
- A Better Way to Attend: Attention With Trees for Video Question Answering2018
Hongyang Xue et al,
- A Case Study of the Shortcut Effects in Visual Commonsense Reasoning2021
Keren Ye et al,
- A Joint Sequence Fusion Model for Video Question Answering and Retrieval2018
Youngjae Yu et al,
- An Analysis of Visual Question Answering Algorithms2017-2015
Kushal Kafle et al,
- A negative case analysis of visual grounding methods for VQA2020
Robik Shrestha et al,
- Answering Questions about Data Visualizations using Efficient Bimodal Fusion2020
Kushal Kafle et al,
Reference and Acknowledgement
Showing a sample of 226 resources. View the full list on GitHub →