awesome-code-benchmark
github.com/tongye98/awesome-code-benchmark ↗A comprehensive code domain benchmark review of LLM researches.
226
GitHub Stars
9
Curated Resources
2
Categories
21 hours ago
Last Refreshed
NewsSurvey
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me news resources from awesome-code-benchmark"
Installation instructions →What's inside
News
- A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
- AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators
- CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks
- LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
- SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
- TRACY: Benchmarking Execution Efficiency of LLM-Based Code Translation
Showing a sample of 9 resources. View the full list on GitHub →