awesome-speech-pretraining
github.com/ddlbojack/awesome-speech-pretraining ↗Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
211
GitHub Stars
73
Curated Resources
1
Categories
23 hours ago
Last Refreshed
Papers
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me speech + text resources from awesome-speech-pretraining"
Installation instructions →What's inside
Papers
- A general multi-task learning framework to leverage text data for speech to text tasksSpeech + Text
- An Unsupervised Autoregressive Model for Speech Representation Learning2019
- BEATs: Audio Pre-Training with Acoustic TokenizersSSL for Audio
- Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition2021
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio RepresentationSSL for Audio
- CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning2022
Showing a sample of 73 resources. View the full list on GitHub →