Skip to main content

An index of algorithms for reinforcement learning from human feedback (rlhf))

91
GitHub Stars
104
Curated Resources
3
Categories
3 hours ago
Last Refreshed
PapersBlogs/Talks/ReportsOpen Source Software/Implementations

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me rlhf for llms: theory / methods resources from awesome-rlhf"

Installation instructions →

What's inside

Showing a sample of 104 resources. View the full list on GitHub →