awesome-visual-question-answering

github.com/jokieleung/awesome-visual-question-answering ↗

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

672

GitHub Stars

226

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me 2018 resources from awesome-visual-question-answering"

Installation instructions →

What's inside

Papers

A Better Way to Attend: Attention With Trees for Video Question Answering2018
Hongyang Xue et al,
A Case Study of the Shortcut Effects in Visual Commonsense Reasoning2021
Keren Ye et al,
A Joint Sequence Fusion Model for Video Question Answering and Retrieval2018
Youngjae Yu et al,
An Analysis of Visual Question Answering Algorithms2017-2015
Kushal Kafle et al,
A negative case analysis of visual grounding methods for VQA2020
Robik Shrestha et al,
Answering Questions about Data Visualizations using Efficient Bimodal Fusion2020
Kushal Kafle et al,

Reference and Acknowledgement

Showing a sample of 226 resources. View the full list on GitHub →