Skip to main content

A curated list of Site Reliability and Production Engineering Tools

1.5k
GitHub Stars
291
Curated Resources
9
Categories
5 hours ago
Last Refreshed
DevelopmentContinuous TestingContinuous IntegrationContinuous DeliveryContinuous MonitoringIncident Management / Incident Response / IT Alerting / On-CallInternal Developer PortalAI SRE Tools & SRE CopilotsRelated Lists

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me integration resources from awesome-sre-tools"

Installation instructions →

What's inside

Continuous Integration

Continuous Monitoring

  • agenttrace

    TUI observability for AI coding agents. Track cost, tokens, tool failures, latency, anomalies, health, diffs, and CI gates across Claude Code, Codex CLI, Gemini CLI, Aider, and Cursor exports.

  • API Status Check

    Real-time status monitoring dashboard for 250+ developer APIs including AWS, Stripe, GitHub, and OpenAI. Free, no signup required.

  • API Status Check

    Centralized dashboard tracking real-time status and outages for 1,000+ popular APIs and services (AWS, Stripe, GitHub, Twilio, etc.). Monitor third-party dependencies, get instant outage alerts, reduce MTTR.

  • Apitally

    API monitoring, analytics, and request logging for REST APIs, with lightweight open-source SDKs for Python, Node.js, Go, .NET, and Java.

  • AppSignal

  • AWS CloudWatch

Incident Management / Incident Response / IT Alerting / On-Call

Continuous Delivery

Development

  • AsanaProject Management & Issue Tracking Software

  • AtomCode Editors and IDEs

  • Azure BoardsProject Management & Issue Tracking Software

  • BasecampProject Management & Issue Tracking Software

  • BitbucketSource Code Management

  • Bitbucket IssuesProject Management & Issue Tracking Software

Related Lists

Showing a sample of 291 resources. View the full list on GitHub →