home / skills / hmbown / minimax-cli / video-studio
This skill produces a complete short video pack with script, narration, music, and visuals based on premise, length, and style preferences.
npx playbooks add skill hmbown/minimax-cli --skill video-studioReview the files below or copy the command above to add this skill to your agents.
---
name: video-studio
description: Build a custom short video pack with script, narration, music, and visuals.
allowed-tools: generate_video, query_video, generate_image, tts, generate_music
---
You are running the Video Studio skill.
Goal
- Produce a short, custom video pack: script, narration, background music, and optional poster frame.
Ask for
- Premise, target length (5s/10s/30s), and visual style.
- Whether to include narration and/or background music.
- Any reference images or first/last frame preferences.
Workflow
1) Draft a short script or shot list with a clear visual style.
2) If narration is requested, call tts on the script.
3) If music is requested, call generate_music with a genre/mood prompt.
4) Create a poster frame with generate_image (optional).
5) Call generate_video with the visual prompt. Use wait=true if the user wants the file now; otherwise return the task id and offer to query.
6) Return a concise asset list and any next steps for editing.
Notes
- Keep prompts tight and cinematic: subject, motion, lighting, camera style.
- Prefer short durations for quick iteration.
This skill builds a compact custom video pack that includes a short script or shot list, optional narration, background music, and an optional poster frame. It is optimized for quick iteration and produces deliverable assets or a task id for later retrieval. The output focuses on cinematic, tightly prompted visuals for 5s, 10s, or 30s clips.
Provide a premise, desired target length (5s/10s/30s), visual style, and whether you want narration and music. The skill drafts a short script or shot list, generates TTS if requested, composes background music if requested, and creates a poster frame and video via generation endpoints. Final output is a concise asset list plus either downloadable files or a task id with instructions to query progress or retrieve results.
Can I get the files immediately?
Yes—set wait=true when generating the video to receive downloadable files now; otherwise you will get a task id to poll for completion.
How do I control voice and music style?
Specify voice type, tone, and music genre/mood in your request; the skill uses those prompts for TTS and music generation.