DevOps Runbooks Skill
Creates actionable runbooks for operational procedures, incident response, and system maintenance.
What This Skill Does
- Creates operational runbooks
- Documents incident procedures
- Defines escalation paths
- Provides troubleshooting guides
- Documents rollback procedures
- Captures operational knowledge
When to Use
- Incident response planning
- On-call documentation
- System maintenance procedures
- Disaster recovery planning
- Knowledge transfer
Reference Files
references/RUNBOOK.template.md- Comprehensive operational runbook format
Runbook Structure
- Overview - Purpose and when to use
- Prerequisites - Access and tools needed
- Quick Reference - Key commands and URLs
- Procedure - Step-by-step with verification
- Rollback - How to revert changes
- Troubleshooting - Common issues
- Escalation - When and how to escalate
Best Practices
- Commands must be copy-pasteable
- Include expected output for each step
- Document decision points clearly
- Define rollback at each step
- Keep procedures current (test regularly)
- Include escalation contacts