awesome-large-audio-models
github.com/emulationai/awesome-large-audio-models ↗Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
732
GitHub Stars
22
Curated Resources
2
Categories
17 min ago
Last Refreshed
Large Audio Models in MusicAudio Datasets
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me audio datasets resources from awesome-large-audio-models"
Installation instructions →What's inside
Audio Datasets
- Audiocaps
Audiocaps: Generating captions for audios in the wild
- Audio set
Audio set: An ontology and human-labeled dataset for audio events
- Clotho
Clotho: An audio captioning dataset
- CommonVoice 11
CommonVoice: A Massively Multilingual Speech Corpus
- CoVoST
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus
- CVSS
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
Large Audio Models in Music
- voicetoinstrument.com
Convert voice to instrumental tracks using AI
Showing a sample of 22 resources. View the full list on GitHub →