awesome-llm-safety

github.com/ydyjya/awesome-llm-safety ↗

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.

1.9k

GitHub Stars

Curated Resources

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me 📖tutorials, articles, presentations and talks resources from awesome-llm-safety"

Installation instructions →

What's inside

😈JailBreak & Attacks

12.10📖Tutorials, Articles, Presentations and Talks
Article
12.10📖Tutorials, Articles, Presentations and Talks
Resource
23.01📖Tutorials, Articles, Presentations and Talks
Community
23.02📖Tutorials, Articles, Presentations and Talks
Resource&Tutorials
23.10📖Tutorials, Articles, Presentations and Talks
Article
23.11📖Tutorials, Articles, Presentations and Talks
Video

🤔AI Safety & Security Discussions

2024/5/20
Managing extreme AI risks amid rapid progress

🔐Security & Discussion

22.02📖Tutorials, Articles, Presentations and Talks
Toxicity Detection API
23.07📖Tutorials, Articles, Presentations and Talks
Repository

📰Truthfulness & Misinformation

23.07📖Tutorials, Articles, Presentations and Talks
Repository
23.10📖Tutorials, Articles, Presentations and Talks
Repository

💯Datasets & Benchmark

23.10📖Tutorials, Articles, Presentations and Talks
Tutorials
RealToxicityPrompts datasets📚Resource📚
TruthfulQA datasets📚Resource📚

🔏Privacy

24.01📖Tutorials, Articles, Presentations and Talks
Tutorials

🤗Introduction

zhrli324

Showing a sample of 17 resources. View the full list on GitHub →