Context Awesome

awesome-rlhf

github.com/opendilab/awesome-rlhf ↗

A curated list of reinforcement learning with human feedback resources (continually updated)

4.4k

GitHub Stars

360

Curated Resources

5

Categories

3 hours ago

Last Refreshed

PapersCodebasesDatasetBlogsBooks

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me 2023 resources from awesome-rlhf"

Installation instructions →

What's inside

Papers

Blogs

A Survey of Reinforcement Learning from Human Feedback (RLHF)
ChatGPT: Optimizing Language Models for Dialogue
Guide to Reinforcement Learning from Human Feedback (RLHF) for Computer Vision
Illustrating Reinforcement Learning from Human Feedback (RLHF)
John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
Reinforcement Learning from Human Feedback: Progress and Challenges
Learning through human feedback

Dataset

Codebases

Showing a sample of 360 resources. View the full list on GitHub →