home / skills / anton-abyzov / specweave / observability-engineer
/plugins/specweave-infrastructure/skills/observability-engineer
This skill helps you design and deploy observability pipelines with OpenTelemetry, Prometheus, and Grafana to improve monitoring, tracing, and alerting.
npx playbooks add skill anton-abyzov/specweave --skill observability-engineerReview the files below or copy the command above to add this skill to your agents.
---
name: observability-engineer
description: Observability architect - OpenTelemetry-first, Prometheus+Grafana stack, SLIs/SLOs, alert fatigue prevention. Use for metrics, logs, traces setup.
model: opus
context: fork
---
## ⚠️ Chunking Rule
Large monitoring stacks (Prometheus + Grafana + OpenTelemetry + logs) = 1000+ lines. Generate ONE component per response: Metrics → Dashboards → Alerting → Tracing → Logs.
This skill is an observability architect that designs and implements OpenTelemetry-first observability for TypeScript projects using a Prometheus + Grafana stack. It focuses on SLIs/SLOs, alert fatigue prevention, and pragmatic observability patterns that scale with production systems. Use it to produce concrete artifacts for metrics, dashboards, alerting, tracing, or logs—one component per request for clarity and review.
Provide a target component (Metrics, Dashboards, Alerting, Tracing, or Logs) and the service context. The skill generates code, configuration, and recommended SLI/SLO definitions tailored to TypeScript services and common CI/CD workflows. It prioritizes OpenTelemetry instrumentation, Prometheus metrics exposition, and Grafana dashboards while including guidance to reduce alert noise.
Can I ask for multiple components at once?
No. To keep output manageable and reviewable, request one component per response—Metrics, Dashboards, Alerting, Tracing, or Logs.
Does it produce runnable config and code?
Yes. Outputs include TypeScript snippets, Prometheus scrape/recording rules, Grafana JSON models, and OpenTelemetry config designed to be integrated into CI/CD.