text_mining_resources
github.com/stepthom/text_mining_resources ↗Resources for learning about Text Mining and Natural Language Processing
597
GitHub Stars
522
Curated Resources
12
Categories
5 hours ago
Last Refreshed
BooksBlogsBlog Articles, Papers, Case StudiesMajor NLP ConferencesBenchmarksOnline coursesAPIs and LibrariesProductsOnline Demos and ToolsDatasetsMiscOther Curated Lists
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me general resources from text_mining_resources"
Installation instructions →What's inside
Blog Articles, Papers, Case Studies
- 100 Must-Read NLP PapersGeneral
- 5 Heroic Tools for Natural Language ProcessingGeneral
- A comparison of Lexicon-based approaches for Sentiment Analysis of microblog postsSentiment Analysis
- A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural NetworksSarcasm Detection
- agrep method in RFuzzy Matching, Probabilistic Matching, Record Linkage, Etc.
- A Guide to Building a Multi-Featured Slackbot with Python- March 2017Q&A Systems, Chatbots
Datasets
- 15 Best Chatbot Datasets for Machine Learning
- 200,000 Russian Troll Tweets
Released by Congress from Twitter suspended accounts and removed from public view.
- AFINNLexicons for Sentiment Analysis
- Amazon product data
- American National Corpus Download
- A Survey of Available Corpora for Building Data-Driven Dialogue Systems
Misc
- A Complete Exploratory Data Analysis and Visualization for Text Data
- AskReddit: People with a mother tongue that isn't English, what are the most annoying things about the English language when you are trying to learn it?
- Detecting Gang-Involved Escalation on Social Media Using Context
- Funny Video: Emotional Spell Check
Products
- Alchemy API
- Amazon Comprehend
- Amazon Lex
- Anafora
- Annotation Lab
Free End-to-End No-Code platform for text annotation and DL model training/tuning. Out-of-the-box support for Named Entity Recognition, Classification, Relation extraction and Assertion Status Spark NLP models. Unlimited support for users, teams, projects, documents.
- Apache PDFBox
APIs and Libraries
Online Demos and Tools
- AllenNLP Demo
- Another word2vec demo
- Cognitive Computation Group - Part of Speech Tagging Demo
Part of Speech Tagging Demo
Books
- An introduction for information retrieval
- Applied Natural Language Processing With Python
- Applied Text Analysis with Python: Enabling Language-Aware Data Products with Machine Learning
- Blueprints for Text Analytics Using Python: Machine Learning-Based Solutions for Common Real World (NLP) Applications
- Deep Learning with Text
- Foundations of Computational Linguistics Human-Computer Communication in Natural Language
Showing a sample of 522 resources. View the full list on GitHub →