Skip to main content

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.

1.9k
GitHub Stars
17
Curated Resources
8
Categories
5 hours ago
Last Refreshed
🤗Introduction🤔AI Safety & Security Discussions🔐Security & Discussion🔏Privacy📰Truthfulness & Misinformation😈JailBreak & Attacks🛡️Defenses & Mitigation💯Datasets & Benchmark

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me 📖tutorials, articles, presentations and talks resources from awesome-llm-safety"

Installation instructions →

What's inside

😈JailBreak & Attacks

  • 12.10📖Tutorials, Articles, Presentations and Talks

    Resource

  • 12.10📖Tutorials, Articles, Presentations and Talks

    Article

  • 23.01📖Tutorials, Articles, Presentations and Talks

    Community

  • 23.02📖Tutorials, Articles, Presentations and Talks

    Resource&Tutorials

  • 23.10📖Tutorials, Articles, Presentations and Talks

    Article

  • 23.11📖Tutorials, Articles, Presentations and Talks

    Video

🤔AI Safety & Security Discussions

  • 2024/5/20

    Managing extreme AI risks amid rapid progress

🔐Security & Discussion

  • 22.02📖Tutorials, Articles, Presentations and Talks

    Toxicity Detection API

  • 23.07📖Tutorials, Articles, Presentations and Talks

    Repository

📰Truthfulness & Misinformation

  • 23.07📖Tutorials, Articles, Presentations and Talks

    Repository

  • 23.10📖Tutorials, Articles, Presentations and Talks

    Repository

💯Datasets & Benchmark

🔏Privacy

  • 24.01📖Tutorials, Articles, Presentations and Talks

    Tutorials

🤗Introduction

Showing a sample of 17 resources. View the full list on GitHub →