home / skills / pluginagentmarketplace / custom-plugin-typescript / devops

devops skill

/skills/devops

This skill helps you deploy and manage cloud infrastructure and CI/CD pipelines with containerization, Kubernetes, and IaC best practices.

npx playbooks add skill pluginagentmarketplace/custom-plugin-typescript --skill devops

Review the files below or copy the command above to add this skill to your agents.

Files (4)
SKILL.md
1.9 KB
---
name: devops-cloud
description: Master DevOps, cloud infrastructure, containerization, CI/CD, Kubernetes, and infrastructure as code. Use when deploying applications, setting up infrastructure, or managing cloud services.
sasmp_version: "1.3.0"
bonded_agent: 03-devops-cloud-infrastructure
bond_type: PRIMARY_BOND
---

# DevOps & Cloud Infrastructure Skill

## Quick Start - Docker

```dockerfile
FROM node:18-alpine

WORKDIR /app

COPY package*.json ./
RUN npm ci --only=production

COPY . .

EXPOSE 3000

CMD ["node", "server.js"]
```

```bash
# Build image
docker build -t myapp:1.0 .

# Run container
docker run -p 3000:3000 myapp:1.0
```

## Core Technologies

### Containerization
- Docker (images, containers, compose)
- Container registries
- Multi-stage builds
- Container security

### Orchestration
- Kubernetes (K8s)
- Helm package management
- Operators and controllers
- GitOps (ArgoCD, Flux)

### Cloud Platforms
- **AWS**: EC2, S3, RDS, Lambda, ECS, EKS
- **GCP**: Compute Engine, Cloud Run, Dataflow
- **Azure**: VMs, App Service, AKS

### CI/CD
- GitHub Actions
- GitLab CI/CD
- Jenkins
- CircleCI

### Infrastructure as Code
- Terraform
- CloudFormation
- Ansible
- Pulumi

### Monitoring
- Prometheus + Grafana
- ELK Stack (Elasticsearch, Logstash, Kibana)
- DataDog, New Relic
- CloudWatch

## Best Practices

1. **Automation** - Automate everything
2. **Infrastructure as Code** - Version control infrastructure
3. **Monitoring** - Comprehensive observability
4. **Security** - Defense in depth
5. **Documentation** - Keep runbooks current
6. **Testing** - Test infrastructure changes
7. **Versioning** - Version all configurations
8. **Disaster Recovery** - Regular DR testing

## Resources

- [Kubernetes Documentation](https://kubernetes.io/docs/)
- [Docker Documentation](https://docs.docker.com/)
- [Terraform Documentation](https://www.terraform.io/docs)
- [AWS Documentation](https://docs.aws.amazon.com/)

Overview

This skill helps engineers and teams master DevOps and cloud infrastructure practices for reliable application delivery. It covers containerization, orchestration, CI/CD, and infrastructure as code so you can deploy, scale, and operate services across major cloud providers. Use it to design pipelines, automate deployments, and manage cloud resources with repeatable patterns and tooling.

How this skill works

The skill inspects common deployment patterns and recommends concrete configurations for containers, CI/CD pipelines, and infrastructure as code. It provides practical examples such as Dockerfiles, CI steps, Terraform patterns, and Kubernetes workload configurations. It also highlights operational controls: monitoring, security hardening, and disaster recovery practices for production systems.

When to use it

  • Building container images and defining runtime behavior for apps
  • Designing and implementing CI/CD pipelines for automated delivery
  • Provisioning cloud infrastructure with Terraform, CloudFormation, or Pulumi
  • Deploying and operating workloads on Kubernetes with Helm or GitOps
  • Setting up observability, logging, and incident runbooks

Best practices

  • Automate everything: builds, tests, deployments, and rollbacks
  • Treat infrastructure as code and store configurations in version control
  • Use multi-stage Docker builds to minimize image size and surface area
  • Implement comprehensive observability (metrics, logs, traces) before incidents
  • Apply defense-in-depth: least privilege, secrets management, and image scanning

Example use cases

  • Create a production-ready Docker image and run it locally for testing
  • Implement GitHub Actions or GitLab CI pipeline to run tests and deploy to staging
  • Provision networking, databases, and compute on AWS using Terraform modules
  • Deploy microservices to EKS/AKS/GKE with Helm charts and ArgoCD for GitOps
  • Add Prometheus metrics and Grafana dashboards for service-level SLOs

FAQ

Which cloud provider should I choose?

Choose the provider that best aligns with your existing tooling, compliance needs, and team expertise; design workloads to be cloud-agnostic where possible using Terraform or container portability.

How do I secure container images and registries?

Use signed images, scan for vulnerabilities in CI, restrict registry access with IAM, and run containers with least privilege and read-only filesystems.