awesome-llms-evaluation-papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

804

GitHub Stars

278

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me :triangular_ruler:alignment evaluation resources from awesome-llms-evaluation-papers"

Showing a sample of 278 resources. View the full list on GitHub →