awesome-llm-safety
github.com/ydyjya/awesome-llm-safety ↗A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
1.9k
GitHub Stars
17
Curated Resources
8
Categories
5 hours ago
Last Refreshed
🤗Introduction🤔AI Safety & Security Discussions🔐Security & Discussion🔏Privacy📰Truthfulness & Misinformation😈JailBreak & Attacks🛡️Defenses & Mitigation💯Datasets & Benchmark
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me 📖tutorials, articles, presentations and talks resources from awesome-llm-safety"
Installation instructions →What's inside
😈JailBreak & Attacks
- 12.10📖Tutorials, Articles, Presentations and Talks
Resource
- 12.10📖Tutorials, Articles, Presentations and Talks
Article
- 23.01📖Tutorials, Articles, Presentations and Talks
Community
- 23.02📖Tutorials, Articles, Presentations and Talks
Resource&Tutorials
- 23.10📖Tutorials, Articles, Presentations and Talks
Article
- 23.11📖Tutorials, Articles, Presentations and Talks
Video
🤔AI Safety & Security Discussions
- 2024/5/20
Managing extreme AI risks amid rapid progress
🔐Security & Discussion
📰Truthfulness & Misinformation
💯Datasets & Benchmark
- 23.10📖Tutorials, Articles, Presentations and Talks
Tutorials
- RealToxicityPrompts datasets📚Resource📚
- TruthfulQA datasets📚Resource📚
🔏Privacy
- 24.01📖Tutorials, Articles, Presentations and Talks
Tutorials
🤗Introduction
Showing a sample of 17 resources. View the full list on GitHub →