Skip to main content

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

3.4k
GitHub Stars
2.3k
Curated Resources
10
Categories
53 min ago
Last Refreshed
News1. Surveys2. Models3. When Coding Meets Reasoning4. Code LLM for Low-Resource, Low-Level, and Domain-Specific Languages5. Methods/Models for Downstream Tasks6. Analysis of AI-Generated Code7. Human-LLM Interaction8. DatasetsOther Awesome LLM Reading Lists

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me 2.1 base llms and pretraining strategies resources from awesome-code-llm"

Installation instructions →

What's inside

2. Models

  • blog2.1 Base LLMs and Pretraining Strategies

  • blog2.1 Base LLMs and Pretraining Strategies

  • blog2.3 General Pretraining on Code

  • blog2.3 General Pretraining on Code

8. Datasets

5. Methods/Models for Downstream Tasks

4. Code LLM for Low-Resource, Low-Level, and Domain-Specific Languages

3. When Coding Meets Reasoning

  • paper3.5 Frontend Navigation

  • paper3.5 Frontend Navigation

Showing a sample of 2.3k resources. View the full list on GitHub →