Skip to main content

A list of awesome NLP resources for Italian language.

1
GitHub Stars
30
Curated Resources
3
Categories
20 hours ago
Last Refreshed
CorporaModelsUseful libraries

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me sentiment analysis resources from awesome-italian"

Installation instructions →

What's inside

Corpora

  • Absita2018Sentiment Analysis

    Booking-crawled dataset for the Evalita Absita competition, ed.2018.

  • Distributional Polarity LexiconSentiment Analysis

    Annotated dataset of sentiment polarity for short (i.e. few words) expressions.

  • EuroparlParallel corpora

    parallel sentences between Italian and English from the European Parlament.

  • Happy ParentsSentiment Analysis

    Annotated datasets of parent to parent and parents to children dialogues.

  • HaSpeeDeHate speech recognition

    Dataset for the Evalita Hate Speech Detection competition, ed.2018 and 2020.

  • I-CABNamed Entity Recognition

    Corpora of annotated articles from "L'Adige" for NER tasks.

Models

  • Feel-ITSentiment Analysis

    a BERT-based sentiment and emotion classifier for Italian.

  • multilang-summarizerText summarization

    A multilingual text summarization model partially supported by the National Council of Science and Technology (CONACYT) of Mexico.

  • SentITASentiment Analysis

    a Bidirectional LSTM-CNN that operates at word level for sentiment polarty classification.

  • UmBERToLanguage Models

    a Roberta-based Language Model trained on large Italian Corpora.

Useful libraries

  • italian-dictionaryOnly Italian

    a Python library to retrieve the meaning of italian lemmas

  • NLTKMultilingual (supporting also Italian)

    Natural Language ToolKit library

  • SpacyMultilingual (supporting also Italian)

    a Python general purpose NLP library

Showing a sample of 30 resources. View the full list on GitHub →