home / skills / gtmagents / gtm-agents / moderation-safety-playbook
/plugins/community-building/skills/moderation-safety-playbook
This skill helps organizations implement moderation safety playbooks with policy matrices, workflows, and escalation guidance for scalable risk management.
npx playbooks add skill gtmagents/gtm-agents --skill moderation-safety-playbookReview the files below or copy the command above to add this skill to your agents.
---
name: moderation-safety-playbook
description: Guidelines and workflows for community moderation, trust & safety, and
escalation.
---
# Moderation & Safety Playbook Skill
## When to Use
- Launching or scaling moderated forums, chats, or live events.
- Training moderators, ambassadors, or vendors on policies.
- Handling escalations, abuse reports, or crisis communications.
## Framework
1. **Policy Matrix** – code of conduct, content types, enforcement tiers, regional considerations.
2. **Workflow** – intake → triage → resolution → follow-up, with SLAs and ownership.
3. **Tooling** – moderation dashboards, keyword lists, automation rules, reporting forms.
4. **Escalation Ladder** – when to engage legal, security, PR, or executive sponsors.
5. **Post-Incident Review** – retrospectives, comms templates, improvements.
## Templates
- Moderator guide with sample responses, tone guidance, and decision trees.
- Incident log + scoreboard for tracking volume, severity, MTTR.
- Crisis comms outline for public/partner updates.
## Tips
- Publish a transparent policy summary so members know expectations.
- Rotate moderators to reduce burnout; provide mental health resources.
- Pair with `launch-community-activation-series` for real-time event safety coverage.
---
This skill provides a production-ready moderation and safety playbook for community platforms, live events, and chat environments. It bundles clear frameworks, actionable workflows, and ready-to-use templates to help teams scale trust & safety operations while keeping response times and risk controlled.
The playbook defines a Policy Matrix that maps content types to enforcement tiers and regional considerations. It prescribes an end-to-end Workflow (intake → triage → resolution → follow-up) with SLAs, ownership, and tooling recommendations. Included templates cover moderator guidance, incident logging, and crisis communications, plus an Escalation Ladder and post-incident review cadence.
How quickly should incidents be triaged?
Set triage SLAs based on severity: immediate (minutes) for safety risks, same-day for harmful content, and 48–72 hours for low-priority reports.
When should legal or PR be engaged?
Escalate to legal for threats, doxxing, or subpoena risks; involve PR for public-facing incidents or where partner reputation is affected.