Skip to main content

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1.9k
GitHub Stars
165
Curated Resources
4
Categories
7 hours ago
Last Refreshed
PublicationsSoftwareDatasetsOther learning materials

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me diarization datasets resources from awesome-diarization"

Installation instructions →

What's inside

Datasets

Software

  • AaltoASRFramework

    Speaker diarization scripts, based on AaltoASR.

  • Alize LIA_SpkSegFramework

    ALIZÉ is an opensource platform for speaker recognition. LIA_SpkSeg is the tools for speaker diarization.

  • asv-subtoolsSpeaker embedding

    ASV-Subtools is developed based on Pytorch and Kaldi for the task of speaker recognition, language identification, etc. The 'sub' of 'subtools' means that there are many modular tools and the parts constitute the whole.

  • ASVtorchSpeaker embedding

    ASVtorch is a toolkit for automatic speaker recognition.

  • Auto-Tuning Spectral ClusteringClustering

    Auto-tuning Spectral Clustering method that does not need development set or supervised tuning.

  • CDEREvaluation

    Conversational DER from The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Showing a sample of 165 resources. View the full list on GitHub →