home / skills / git-fg / thecattoolkit / processing-media
This skill analyzes and edits video using ffmpeg, translating natural language commands into edits while ensuring quality.
npx playbooks add skill git-fg/thecattoolkit --skill processing-mediaReview the files below or copy the command above to add this skill to your agents.
---
name: processing-media
description: "Handles video editing, ffmpeg processing, and visual analysis. Use when transforming raw footage into polished output or analyzing visual content."
allowed-tools: [Read, Write, Edit, Glob, Bash(ffmpeg:*), Bash(ffprobe:*)]
---
# Video Editing Protocol
## Standards & Styles
### 1. Cinematic Style
- **Color:** High contrast, teal/orange grading, 24fps.
- **Motion:** Smooth stabilization, slow motion (60fps -> 24fps).
- **Audio:** Music-driven cuts, immersive sound design.
### 2. Vlog / Social Style
- **Color:** Bright, saturated, natural skin tones.
- **Motion:** Fast cuts, jump cuts, handheld feel.
- **Audio:** Clear dialogue, background music ducking.
### 3. Corporate / Explainers
- **Color:** Clean, neutral, branded palette.
- **Motion:** Static shots, smooth transitions, screen recordings.
- **Audio:** Voiceover-dominant.
## Workflow
1. **Analyze**: Use processing-media skill to understand video content (specs, visual content).
2. **Translate**: Convert natural language command to edit parameters (EDL/FFMPEG).
3. **Edit**: Apply changes using ffmpeg.
4. **Verify**: Validate output quality programmatically (Self-Verification).
## Quality Standards
- No artifacts or quality degradation
- Clear dialogue, balanced audio levels
- Narrative flow enhancement
- Accurate intent interpretation
## Reference Library
- `references/edl-generation.md`: How to structure complex edits.
- `references/frame-analysis.md`: Visual inspection protocols.
- `references/ffmpeg-recipes.md`: (Common commands - *to be populated*)
This skill handles video editing, ffmpeg-based processing, and visual analysis to turn raw footage into polished deliverables. It encodes editing intents into reproducible workflows and enforces quality standards for color, motion, and audio. Use it to automate conversions, generate edit decision lists, and validate final outputs programmatically.
The skill inspects input media to extract specs (codec, frame rate, resolution) and performs frame-level visual analysis for shot detection, stabilization needs, and color grading cues. Natural language edit requests are translated into parameterized EDLs and ffmpeg command sequences which are executed to render edits. Outputs are verified with automated checks for artifacts, audio balance, and expected metadata.
What input formats and frame rates are supported?
The skill reads common codecs and containers and normalizes varied frame rates into target timelines; uncommon formats may require pre-transcoding.
How does verification work?
Verification runs programmatic checks: artifact detection, audio loudness and ducking validation, frame-count consistency, and metadata comparison against expected values.
Can I customize style presets?
Yes. Styles are exposed as parameter templates (color curves, LUTs, stabilization settings, cut pacing) that you can modify and reuse.