Skip to main content

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

732
GitHub Stars
22
Curated Resources
2
Categories
17 min ago
Last Refreshed
Large Audio Models in MusicAudio Datasets

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me audio datasets resources from awesome-large-audio-models"

Installation instructions →

What's inside

Audio Datasets

  • Audiocaps

    Audiocaps: Generating captions for audios in the wild

  • Audio set

    Audio set: An ontology and human-labeled dataset for audio events

  • Clotho

    Clotho: An audio captioning dataset

  • CommonVoice 11

    CommonVoice: A Massively Multilingual Speech Corpus

  • CoVoST

    CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus

  • CVSS

    CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus

Large Audio Models in Music

Showing a sample of 22 resources. View the full list on GitHub →