awesome-speech-language-model
github.com/ddlbojack/awesome-speech-language-model ↗Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
198
GitHub Stars
44
Curated Resources
4
Categories
23 hours ago
Last Refreshed
Universal Speech, Audio and Music UnderstandingEnd2End Speech Dialogue SystemFull Duplex ModelingSurvey
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me benchmark resources from awesome-speech-language-model"
Installation instructions →What's inside
Full Duplex Modeling
- A Full-duplex Speech Dialogue Scheme Based On Large Language Models
- Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
- Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents
- Enabling Real-Time Conversations with Minimal Training Costs
- Language Model Can Listen While Speaking
Universal Speech, Audio and Music Understanding
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionBenchmark
- A Suite for Acoustic Language Model EvaluationBenchmark
- AudioBench: A Universal Benchmark for Audio Large Language ModelsBenchmark
- Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesModel
- Distilling an End-to-End Voice Assistant Without Instruction Training DataModel
- Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 TasksBenchmark
Survey
End2End Speech Dialogue System
Showing a sample of 44 resources. View the full list on GitHub →