awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1.9k

GitHub Stars

165

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me diarization datasets resources from awesome-diarization"

2000 NIST Speaker Recognition EvaluationDiarization datasets
Disk-6 (Switchboard) , Disk-8 (CALLHOME)
2003 NIST Rich Transcription Evaluation DataDiarization datasets
Together with audios
AudioSetAugmentation noise sources
2M
BookTubeSpeechSpeaker embedding training sets
8K
CALLHOME American English SpeechDiarization datasets
CALLHOME American English Transcripts
CN-CelebSpeaker embedding training sets
130K+

AaltoASRFramework
Speaker diarization scripts, based on AaltoASR.
Alize LIA_SpkSegFramework
ALIZÉ is an opensource platform for speaker recognition. LIA_SpkSeg is the tools for speaker diarization.
asv-subtoolsSpeaker embedding
ASV-Subtools is developed based on Pytorch and Kaldi for the task of speaker recognition, language identification, etc. The 'sub' of 'subtools' means that there are many modular tools and the parts constitute the whole.
ASVtorchSpeaker embedding
ASVtorch is a toolkit for automatic speaker recognition.
Auto-Tuning Spectral ClusteringClustering
Auto-tuning Spectral Clustering method that does not need development set or supervised tuning.
CDEREvaluation
Conversational DER from The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Showing a sample of 165 resources. View the full list on GitHub →