awesome-kurdish
github.com/happyhackingspace/awesome-kurdish ↗A curated list of resources about the Kurdish language, culture, science, and technology.
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me language learning resources from awesome-kurdish"
Installation instructions →What's inside
Language Resources
- 50LanguagesLanguage Learning
A free website used for learning Kurdish.
- A Dataset for the Classification of Different Kurdish DialectsDatasets
A 6,000-sample dataset for Kurdish dialect recognition.
- A Dataset for the Classification of Different Kurdish DialectsDatasets
A dataset of 6,000 one-second audio samples for Kurdish dialect recognition.
- Data.krdDatasets
DataKrd is dedicated to making Kurdish datasets accessible to all.
- FerhengcoDictionaries and Corpora
Ferheng.co is an Kurdish (Kurmanji) - Turkish Dictionary.
- Ferheng KurdiDictionaries and Corpora
Ferheng Kurdi can be used as a platform for translating Kurdish (Kurmanji) into various languages
Natural Language Processing
- AI2001_Category-Linguistics-SC-KurdishLibraries and Tools
linguistic:Kurdish category for AI2001, containing Kurdish language linguistic datasets.
- AsoSoft-TTS-Speech-Corpus-for-Central-KurdishText-to-Speech
Speech data in the Kurdish language and associated resources such as tags are among the most essential language resources needed for NLP research and applications such as speech synthesis and automatic speech recognition, etc. Speech data for Central Kurdish was created and gathered to use in speech synthesis and automatic speech recognition as part of this project. In order to create this corpus, 21 hours of speech have been recorded and transcribed.
- Character ConvertorLibraries and Tools
Kurdish Language Library for converting characters and digits in Persian, English and Arabic to Kurdish and vice versa.
- ElevenLabs (Kurdish)Speech Recognition
Free Central Kurdish speech to text using our advanced AI transcription tool, Scribe. Transcribe Central Kurdish voice, audio, and speech with industry-leading accuracy—Scribe outperforms Google Gemini and OpenAI Whisper, delivering a word error rate of just 3.1% on the FLEURS benchmark and 5.5% on Common Voice. Get accurate Central Kurdish transcriptions for films, podcasts, business meetings, medical dictation, and more.
- KURD.AISpeech Recognition
A comprehensive platform offering AI services chat, image, text-to-voice, and more for diverse needs.
- kurdiLibraries and Tools
Various Kurdi related work done by Kurdish developers.
Academic Research
- A Kurdish Sorani Twitter dataset for language modelling
This paper presents a Kurdish sentiment analysis dataset of 24,668 labeled tweets.
- Dataset for the recognition of Kurdish sound dialects
This paper presents a Kurdish dialect recognition dataset for improving speech recognition systems.
- Iraqi Legal Gpt
This paper introduces a small AI chatbot for offline legal information in Iraqi law with 80% accuracy.
- KLPT – Kurdish Language Processing Toolkit
This paper introduces a Kurdish language processing toolkit to address the lack of basic tools for this under-resourced language, which includes components like tokenization, stemming, and lemmatization. The toolkit is extendable by future developers and is publicly available.
- KuBERT: Central Kurdish BERT Model and Its Application for Sentiment Analysis
This paper enhances Kurdish sentiment analysis using BERT, achieving better accuracy than traditional models.
- Kurdish News Dataset Headlines (KNDH) through multiclass classification
KNDH is a dataset of 50,000 Kurdish news headlines across five categories for text classification.
Programming Resources
- Flutter Kurdish LocalizationLocalization Projects
Localization support for Central Kurdish Branch Sorani (Kurdish: سۆرانی ,Soranî)
Showing a sample of 48 resources. View the full list on GitHub →