awesome-linguistics
github.com/theimpossibleastronaut/awesome-linguistics ↗A curated list of anything remotely related to linguistics
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me lists resources from awesome-linguistics"
Installation instructions →What's inside
Lists
Data sets
- Araneum Germanicum
- CEHugeWebCorpus
German corpus based on CommonCrawl
- C-WEP
- Digitales Wörterbuch der deutschen Sprache (DWDS)
- DysList (list of dyslexic errors)
- EuroRomCom Data
JSON formatted Pan-Romance word lists.
On Wikipedia
Platforms and toolkits
- CLARIN-D web tools
Tools for Analysing Research Data
- CorpusExplorer
Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 50 interactive visualizations under a user-friendly interface.
- Haxe-linguistics
Early linguistical analysis and natural language processing library for Haxe.
- Mate Tools
- Natural
General natural language tools for Node.js.
- Natural Language ToolKit (NLTK)
The most complete platform for building Python programs to work with human language data.
On Youtube
- Computational Linguistics Lecture Playlist (Youtube)
Lectures for University of Maryland class on computational linguistics.
- The Virtual Linguistics Campus
CC-licensed educational videos interconnected with Marburg University's e-learning platform of the same name.
Deep learning models and transformers
Standards
Books
- Essentials of Linguistics, 2nd edition
An introductory book (2nd edition).
- Foundations of Computational Linguistics
- Foundations of Statistical Natural Language Processing
- Introduction to Linguistics
- Natural Language Processing with Python
The book from the NLTK package.
- Semisupervised Learning for Computational Linguistics
Showing a sample of 79 resources. View the full list on GitHub →