awesome-diarization
github.com/wq2012/awesome-diarization ↗A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me diarization datasets resources from awesome-diarization"
Installation instructions →What's inside
Datasets
- 2000 NIST Speaker Recognition EvaluationDiarization datasets
Disk-6 (Switchboard) , Disk-8 (CALLHOME)
- 2003 NIST Rich Transcription Evaluation DataDiarization datasets
Together with audios
- AudioSetAugmentation noise sources
2M
- BookTubeSpeechSpeaker embedding training sets
8K
- CALLHOME American English SpeechDiarization datasets
CALLHOME American English Transcripts
- CN-CelebSpeaker embedding training sets
130K+
Software
- AaltoASRFramework
Speaker diarization scripts, based on AaltoASR.
- Alize LIA_SpkSegFramework
ALIZÉ is an opensource platform for speaker recognition. LIA_SpkSeg is the tools for speaker diarization.
- asv-subtoolsSpeaker embedding
ASV-Subtools is developed based on Pytorch and Kaldi for the task of speaker recognition, language identification, etc. The 'sub' of 'subtools' means that there are many modular tools and the parts constitute the whole.
- ASVtorchSpeaker embedding
ASVtorch is a toolkit for automatic speaker recognition.
- Auto-Tuning Spectral ClusteringClustering
Auto-tuning Spectral Clustering method that does not need development set or supervised tuning.
- CDEREvaluation
Conversational DER from The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Publications
- A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party MeetingsSpecial topics
- AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference ScenarioOther
- An End-to-End Speaker Diarization Service for improving Multimedia Content AccessOther
- An overview of automatic speaker diarization systemsOther
- A Review of Speaker Diarization: Recent Advances with Deep LearningSpecial topics
- A review on speaker diarization systems and approachesSpecial topics
Other learning materials
- A Tutorial on Speaker DiarizationOnline courses
- Fully Supervised Speaker Diarization: Say Goodbye to clusteringVideo tutorials
- Google's Diarization System: Speaker Diarization with LSTMVideo tutorials
- Literature Review For Speaker Change DetectionTech blogs
- pyannote audio: neural building blocks for speaker diarizationVideo tutorials
- Robust Speaker Diarization for Meetings: the ICSI systemVideo tutorials
Showing a sample of 165 resources. View the full list on GitHub →