awesome-video-llms
github.com/zyayoung/awesome-video-llms βExplore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.
36
GitHub Stars
27
Curated Resources
4
Categories
1 hour ago
Last Refreshed
Awesome Video Large Language Models π₯π¦Datasets πΎπResultsJoin the Awesome Video Large Language Models Community ππ€
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me join the awesome video large language models community ππ€ resources from awesome-video-llms"
Installation instructions βWhat's inside
Join the Awesome Video Large Language Models Community ππ€
Datasets πΎπ
- DiDemo link
Video captioning, temporal localization
- HMDB51 link
Action recognition
- Kinetics-400 link
Action recognition
- MSRVTT link
Video QA, video captioning
- MSVD link
Video QA, video captioning
- NExT-QA link
Video QA
Results
- LLaMA-VID
50.1
- Video-LLaMA
32.2
- Video-LLaVA (Lin et al.)
48.0
Showing a sample of 27 resources. View the full list on GitHub β