awesome-data-contamination
github.com/lyy1994/awesome-data-contamination ↗The Paper List on Data Contamination for Large Language Models Evaluation.
115
GitHub Stars
132
Curated Resources
2
Categories
3 hours ago
Last Refreshed
📜 Papers🧰 Resources
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me 🎯 the list resources from awesome-data-contamination"
Installation instructions →What's inside
🧰 Resources
- Language Model Evaluation Harness🛠️ Tools
- LLM Decontaminator🛠️ Tools
- MIMIR📊 Datasets
- mock_gsm8k_test📊 Datasets
- PatentMIA📊 Datasets
- WikiMIA📊 Datasets
Showing a sample of 132 resources. View the full list on GitHub →