awesome-text-ml
github.com/oskar-j/awesome-text-ml ↗A curated list of ML awesome frameworks & libraries for text data
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me :snake: python resources from awesome-text-ml"
Installation instructions →What's inside
No longer maintained
- artificial-adversary
Tool to generate adversarial text examples and test machine learning models against them.
- DELTA
DELTA is a deep learning based natural language and speech processing platform.
- EventForecast
Time series prediction and text analysis using Keras LSTM, plus clustering, association rules mining.
- lazynlp
Library to scrape and clean web pages to create massive datasets.
- MeTA: ModErn Text Analysis
A Modern C++ Data Sciences Toolkit.
- NeuronBlocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego.
Frameworks and libraries
- AugLy:snake: Python
A data augmentations library from Facebook research for audio, image, text, and video.
- bert-as-service:snake: Python
Mapping a variable-length sentence to a fixed-length vector using BERT model.
- BIG-bench:snake: Python
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models.
- dedupe:snake: Python
A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
- DeepPavlov:snake: Python
An open source library for deep learning end-to-end dialog systems and chatbots.
- fairseq:snake: Python
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Knowledge 📚
- Awesome Sentiment AnalysisMultiple languages
Repository with all what is necessary for sentiment analysis and related areas
- nlp-recipesPython (and Python Notebooks)
Comprehensive set of tools and examples that leverage recent advances in NLP algorithms, neural architectures, and distributed machine learning systems.
- practicalAIPython (and Python Notebooks)
A practical approach to machine learning to enable everyone to learn, explore and build.
- VirgilioLearning 101
Virgilio is an open-source initiative, aiming to mentor and guide anyone in the world of the Data Science.
Showing a sample of 35 resources. View the full list on GitHub →