Skip to main content

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

803
GitHub Stars
278
Curated Resources
2
Categories
6 hours ago
Last Refreshed
Related Surveys for LLMs EvaluationPapers

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me :triangular_ruler:alignment evaluation resources from awesome-llms-evaluation-papers"

Installation instructions →

What's inside

Papers

  • Github:triangular_ruler:Alignment Evaluation

  • GitHub:earth_americas:Evaluation Organization

  • GitHub:triangular_ruler:Alignment Evaluation

    a Pragmatic Alignment Evaluation"

  • GitHub:earth_americas:Evaluation Organization

  • Paper:books:Knowledge and Capability Evaluation

  • Paper:books:Knowledge and Capability Evaluation

Related Surveys for LLMs Evaluation

Showing a sample of 278 resources. View the full list on GitHub →