Skip to main content

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

538
GitHub Stars
33
Curated Resources
7
Categories
16 hours ago
Last Refreshed
Good resources for beginners:Good resources for more complex stuff:Good UtilsGood Kaldi "production ready" examplesAvailable pretrained models:Resources for understanding the math/science behind Kaldi better:Important articles

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me resources for understanding the math/science behind kaldi better: resources from awesome-kaldi"

Installation instructions →

What's inside

Resources for understanding the math/science behind Kaldi better:

Important articles

Good resources for beginners:

Good Utils

  • combine_data.sh

    If you have multiple datasets and you want to combine all of the manually, there is no need to do it file after file. this script will take an entire data directory and will combine all the files into the same new directory.

  • Finetune acoustic model

    If you don't have a lot of data You can always train a Kaldi model from the closest domain to your domain and then take the

  • Kaldi-ONNX project by XiaoMi

    A project that helps transferring the Kaldi model into ONNX so you could easily use the model in different frameworks.

  • perturb_data_dir_speed_3way.sh

    this script will help you to change the speaking speed of different utterances without creating excess files. It does this by implementing an SoX command to your wav file and copying and editing all the other files in your folder. Using this script and also the next one is a must-have in most state-of-the-art systems and will help your model to generalize better.

  • perturb_data_dir_volume.sh

    this script will do exactly the same but will change the volume of the utterances.

  • resample_data_dir.sh

    You want to make a new model for different sampling rate but you don't want to manually re-sample you entire data? this script will help you to do it, again with a SoX command.

Good Kaldi "production ready" examples

  • compile Kaldi for android

    You can also compile the Kaldi project in a way that will work directly on android devices. That might not be a good idea with a heavy model, but can be used to more constrained models.

  • kaldi-adapt-lm

    A tool that helps to adapt nnet3 chain models to a different language model.

  • kaldi-gstreamer-server

    this is a nice project that will help you to integrate Kaldi toolkit and the

  • kaldi-offline-transcriber

    A good example for a project that handles both training and decoding. It is being build for Estonian but can be easily transformed into any language.

  • online2-tcp-nnet3-decode-faster

    A new excutable that was

  • tf-kaldi-speaker

    A framework that combines TensorFlow and Kaldi in the context of speaker verification/identification tasks. The project has some pretrained model that were trained on huge datasets.

Good resources for more complex stuff:

Available pretrained models:

Showing a sample of 33 resources. View the full list on GitHub →