Skip to main content

A curated list of ML awesome frameworks & libraries for text data

17
GitHub Stars
35
Curated Resources
3
Categories
17 hours ago
Last Refreshed
Frameworks and librariesKnowledge 📚No longer maintained

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me :snake: python resources from awesome-text-ml"

Installation instructions →

What's inside

No longer maintained

  • artificial-adversary

    Tool to generate adversarial text examples and test machine learning models against them.

  • DELTA

    DELTA is a deep learning based natural language and speech processing platform.

  • EventForecast

    Time series prediction and text analysis using Keras LSTM, plus clustering, association rules mining.

  • lazynlp

    Library to scrape and clean web pages to create massive datasets.

  • MeTA: ModErn Text Analysis

    A Modern C++ Data Sciences Toolkit.

  • NeuronBlocks

    NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego.

Frameworks and libraries

  • AugLy:snake: Python

    A data augmentations library from Facebook research for audio, image, text, and video.

  • bert-as-service:snake: Python

    Mapping a variable-length sentence to a fixed-length vector using BERT model.

  • BIG-bench:snake: Python

    Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models.

  • dedupe:snake: Python

    A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

  • DeepPavlov:snake: Python

    An open source library for deep learning end-to-end dialog systems and chatbots.

  • fairseq:snake: Python

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Knowledge 📚

  • Awesome Sentiment AnalysisMultiple languages

    Repository with all what is necessary for sentiment analysis and related areas

  • nlp-recipesPython (and Python Notebooks)

    Comprehensive set of tools and examples that leverage recent advances in NLP algorithms, neural architectures, and distributed machine learning systems.

  • practicalAIPython (and Python Notebooks)

    A practical approach to machine learning to enable everyone to learn, explore and build.

  • VirgilioLearning 101

    Virgilio is an open-source initiative, aiming to mentor and guide anyone in the world of the Data Science.

Showing a sample of 35 resources. View the full list on GitHub →