Skip to main content

This repository collects all relevant resources about interpretability in LLMs

401
GitHub Stars
239
Curated Resources
8
Categories
5 hours ago
Last Refreshed
Survey PapersPosition PapersInterpretable Analysis of LLMsSAE, Dictionary Learning and SuperpositionInterpretability in Vision LLMsBenchmarking InterpretabilityEnhancing InterpretabilityOthers

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me resources resources from awesome-interpretability-in-large-language-models"

Installation instructions →

What's inside

Showing a sample of 239 resources. View the full list on GitHub →