Skip to main content

A repo to keep all resources about interpretability in NLP organised and up to date

13
GitHub Stars
25
Curated Resources
9
Categories
19 hours ago
Last Refreshed
Social SciencesFact CheckingMachine Reaching Comprehension / Question AnsweringSaliency mapsGenerating RationalesOtherOn Human RationalesDatasets with HighlightsDatasets with Textual explanations

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me other resources from awesome-text-interpretability"

Installation instructions →

What's inside

Other

On Human Rationales

  • Evaluating and Characterizing Human Rationales, EMNLP 2020

    An open question, however, is how human rationales fare with these automatic metrics - do not necessarily perform well- reveal irrelevance and redundancy. Our work leads to actionable suggestions for evaluating and characterizing rationales.

  • From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?, EMNLP 2020

    The syntactic signatures available in Sentence and Jabberwocky LSTM representations are similar, and can be predicted from either the Sentence or Jabberwocky EEG. From our results, we can infer which LSTM representations encode semantic and/or syntactic information. We confirm using syntactic and semantic probing tasks. Our results show that there are similarities between the way the brain and an LSTM represent stimuli from both the Sentence (within-distribution) and Jabberwocky (out-of-distribution) conditions.

Fact Checking

Generating Rationales

Saliency maps

Machine Reaching Comprehension / Question Answering

Showing a sample of 25 resources. View the full list on GitHub →