cinematic-director

Name: cinematic-director
Author: wuwangzhang1216
ASecurity
Converts scripts, prose, story ideas, or existing AI image/video assets into professional cinematic direction deliverables: script breakdowns, director's book notes, continuity bible, shot lists, storyboard/keyframe prompts, image-to-video motion prompts, tool-specific prompt adapters, and QC repair notes. Optionally applies a named director's style lens (Spielberg, Hitchcock, Kubrick, Kurosawa, Scorsese, Fellini, Bergman, Tarkovsky, Wong Kar-wai, Nolan, Villeneuve, Fincher, Refn, Bi Gan) loa...
2 stars
0 votes
0 copies
11 views
Added 5/27/2026
ai-agentsgoreactperformance
Works with

cli
Security Analysis

A100/100
Scanned 5/27/2026
Install via CLI
$openskills install wuwangzhang1216/DirectorSKILL
Files
SKILL.md
---
name: cinematic-director
# Keep the description specific so the host agent loads this skill only for film/video direction tasks.
description: Converts scripts, prose, story ideas, or existing AI image/video assets into professional cinematic direction deliverables: script breakdowns, director's book notes, continuity bible, shot lists, storyboard/keyframe prompts, image-to-video motion prompts, tool-specific prompt adapters, and QC repair notes. Optionally applies a named director's style lens (Spielberg, Hitchcock, Kubrick, Kurosawa, Scorsese, Fellini, Bergman, Tarkovsky, Wong Kar-wai, Nolan, Villeneuve, Fincher, Refn, Bi Gan) loaded from references/director_styles/ to override camera, lighting, editing, sound, and prompt defaults. Use for AI filmmaking, short films, trailers, storyboards, Runway/Veo/Kling/Luma/Jimeng/Dreamina/Seedance-style workflows, "in the style of X" direction, or any task involving shot planning, blocking, staging, camera movement, visual continuity, and cinematic prompt generation.
license: MIT
metadata:
  version: "1.2.0"
  author: "wangzhang-wu"
---

# Cinematic Director Skill

## Purpose

Act as a professional director/previs lead for AI-assisted filmmaking. Transform source material into executable visual production plans, not generic “cinematic prompts.” The skill should bridge three domains:

1. Traditional directing: story/subtext, blocking, staging, shot rationale, visual language.
2. Pre-production: script breakdown, director's book, shot list, storyboard, previs, continuity tracking.
3. AI video production: reference assets, keyframes, image-to-video motion prompts, first/last-frame workflows, prompt repair, and generation QC.

## Non-goals

Do not behave like a film critic unless the user asks for critique. Do not write vague inspirational film language. Do not generate random shot variety. Every shot must have a story function. Do not overload a single AI video prompt with many unrelated actions.

## Default assumptions

- Default output language: match the user's language.
- Default image/video generation prompt language: English, unless the user requests Chinese or the target tool works better in Chinese.
- Default aspect ratio for cinematic narrative: 16:9 unless user says 4:3, 9:16, 1:1, etc.
- Default AI-video unit: 3–6 seconds per shot for difficult character/performance motion; 5–10 seconds for simple environmental or product motion; up to 15 seconds only when the target model supports longer stable narrative progression.
- Default camera: locked/static or slow push-in when continuity is fragile. Use complex camera moves only when they serve the scene and the tool supports them.
- Default safety against AI-video failure: one primary subject action + one camera behavior + one environmental motion per shot.

## When to activate this skill

Use this skill when the user asks for any of the following:

- Turn a story, prose, script, or concept into a film/video plan.
- Create a shot list, storyboard, keyframe list, or AI video prompt table.
- Improve AI video continuity, avoid slideshow/PPT feeling, or repair failed generations.
- Design a director-style workflow for Runway, Veo, Kling, Luma, 即梦/Dreamina, Seedance, Wanxiang, or similar tools.
- Analyze existing character/location assets and decide what shots should be generated next.
- Create a “director agent,” “director skill,” “cinematic prompt skill,” “AI video workflow,” or “film production agent.”
- Treat a scene “in the style of” a named director (Spielberg, Hitchcock, Kubrick, Kurosawa, Scorsese, Fellini, Bergman, Tarkovsky, Wong Kar-wai, Nolan, Villeneuve, Fincher, Refn, Bi Gan) — load the matching lens from `references/director_styles/`.

## Required inputs to extract

If the user provides incomplete information, proceed with reasonable assumptions and state them briefly. Do not block on clarification unless the missing information would make the output unusable.

Extract or infer:

```yaml
project:
  title: optional
  format: short film | trailer | social video | scene | commercial | music video | unknown
  target_duration: seconds or minutes
  aspect_ratio: 16:9 | 9:16 | 4:3 | 1:1 | unknown
  target_tool: Runway | Veo | Kling | Luma | Jimeng/Dreamina | Seedance | Wanxiang | other | unknown
  director_style: spielberg | hitchcock | kubrick | kurosawa | scorsese | fellini | bergman | tarkovsky | wong_kar_wai | nolan | villeneuve | fincher | refn | bi_gan | none
source:
  text: script/prose/brief
  genre: horror | drama | thriller | comedy | documentary | etc.
  core_conflict: inferred
  emotional_arc: inferred
assets:
  characters: reference images/videos if provided
  locations: reference images/videos if provided
  props: reference images/videos if provided
  audio: voice/music/reference if provided
constraints:
  era: e.g. Republican-era China, modern Toronto, etc.
  must_keep: identity, costume, lighting, setting, props, color palette
  must_avoid: modern objects, text, watermark, extra characters, face change, etc.
output:
  deliverables: analysis | director_book | shot_list | keyframes | video_prompts | qc | repair_plan
```

## Core workflow

Follow this sequence. Keep the output proportional to the task.

### 1. Script and subtext breakdown

Identify:

- literal event: what happens on the surface
- dramatic question: what the audience should wonder
- conflict: external conflict and internal conflict
- subtext: what the scene is really about
- emotional movement: beginning emotion → pressure → turn → ending emotion
- visual thesis: the simplest visual idea that governs the scene

Rule: the visual thesis should be concrete, such as “a boy gets smaller as the room becomes more judgmental,” not abstract, such as “loneliness and fate.”

### 2. Beat map

Break the scene into beats, not arbitrary shots. Each beat should have:

- story function
- character state
- visual action
- pressure increase or release
- likely shot family: establishing, relation, reaction, detail, transition, climax, aftermath

### 3. Director style selection (optional)

If the user names a director or asks for "in the style of X", load the matching file from `references/director_styles/` (see that folder's README for the index of available lenses). A style is a single coherent lens that overrides defaults at later steps:

- Step 4 (Director's book): camera grammar, lighting, color palette, editing rhythm, sound, performance
- Step 6 (Shot list): preferred shot families and camera moves
- Step 8 (AI video prompt construction): pacing, framing register, prompt template tendencies

Pick exactly one style. Mixing two directors produces incoherent output. If no director is named, skip this step and let project tone/genre set the defaults at Step 4.

Available styles: spielberg, hitchcock, kubrick, kurosawa, scorsese, fellini, bergman, tarkovsky, wong_kar_wai, nolan, villeneuve, fincher, refn, bi_gan.

Style rule: these modules describe high-level methods only. Do not copy specific shots, lines, characters, or plots from any director's actual films — use the lens to inform original work.

### 4. Director's book / visual treatment

If a style was selected in Step 3, use its corresponding sections (镜头语言 / 灯光与色彩 / 剪辑节奏 / 声音与音乐 / 人物与表演) as the starting defaults below. Otherwise derive defaults from the project's tone and genre.

Define the reusable visual rules:

- tone and genre
- lens and framing tendency
- camera grammar
- lighting logic
- color palette
- production design anchors
- performance style
- editing rhythm
- sound direction if relevant

Keep this as a compact “bible” that stabilizes later prompts.

### 5. Blocking and staging

For every shot involving people, define:

- start position
- subject movement
- interaction with space/props
- camera relationship to the subject
- end position
- eyeline or attention target

Rule: blocking is not just where actors stand. It is how body position, movement, props, and camera placement reveal power, fear, attraction, secrecy, isolation, or irony.

### 6. Shot list

Create shot rows only after beats and blocking are clear. Each shot must include:

- shot number
- duration
- scene/location
- story function
- shot size
- camera angle
- camera movement
- blocking/action: start → motion → end
- lighting/atmosphere
- continuity anchors
- transition
- AI generation note

Use `assets/shot-plan-template.md` when the user asks for a table.

### 7. Keyframe strategy

Choose keyframes based on continuity risk:

- Use single first-frame image-to-video for simple action inside one stable composition.
- Use first-frame + last-frame for controlled transformation, clear action endpoint, or transition between two designed compositions.
- Use separate keyframes per shot when character identity, costume, location, or blocking must stay stable.
- Use storyboard panels when a model tends to treat images like still slides; make each panel contain an action implication, not just a pose.

Use `assets/keyframe-prompt-template.md` when generating image/keyframe prompts.

### 8. AI video prompt construction

For image-to-video, assume the image already defines identity, composition, lighting, setting, costume, and style. The text prompt should mainly define:

- subject motion
- camera motion
- environmental motion
- pace
- end state
- constraints

Use this structure:

```text
[Camera behavior]. [Subject starts in visible state], then [single primary action with pace and direction]. [Environment reacts subtly]. End with [clear final pose/composition]. Maintain [identity/costume/location/lighting]. Avoid [failure modes].
```

For text-to-video, include visual description as well:

```text
[Format/style]. [Subject + specific visual identity]. [Location + time + atmosphere]. [Primary action]. [Camera framing + angle + movement]. [Lighting + lens/composition]. [Audio if supported]. [Constraints].
```

For multi-shot/timestamp models, use:

```text
Overall: [theme, tone, character/setting continuity]
[00:00-00:03] Shot 1: [shot size, action, camera, emotion]
[00:03-00:06] Shot 2: [shot size, action, camera, emotion]
[00:06-00:10] Shot 3: [shot size, action, camera, emotion]
```

Use `assets/video-prompt-template.md` when generating final prompts.

### 9. Tool adapter selection

Read `references/ai-video-tool-adapters.md` when the user names a specific model/tool. Apply its constraints before writing prompts.

Short defaults:

- Runway-style image-to-video: use image for scene/style; text for motion/camera/temporal progression.
- Veo-style: strong for style + sound + timestamp/multi-shot instructions when supported.
- Kling-style: emphasize unfolding action, physical performance, clear character labels for dialogue/audio, and image-as-anchor for image-to-video.
- Luma-style: use keyframes, character/visual references, camera tags, loop/extend when appropriate.
- 即梦/Dreamina/Seedance-style: use Chinese or bilingual prompts when helpful, keep prompt concise, use first/last-frame or reference binding, state motion + camera clearly, avoid too many simultaneous actions.
- Wanxiang-style: for image-to-video, focus prompt on motion + camera; for multi-shot, use overall description + numbered shots + timestamps.

### 10. QC and repair loop

Before finalizing, check against `assets/qc-checklist.md`.

For failed AI video, diagnose in this order:

1. Asset mismatch: character/location references too inconsistent.
2. Prompt overload: too many actions or camera moves in one shot.
3. Weak action: frame is a pose, not an implied motion.
4. Missing endpoint: prompt does not say where action ends.
5. Bad camera logic: move contradicts framing or space.
6. Continuity gap: previous shot end state does not match next shot start state.
7. Tool mismatch: asking a model for a feature it does not control well.

Then output a repair plan:

```markdown
## Failure diagnosis
- [specific issue]

## Fix
- [specific change]

## Revised prompt
[clean prompt]

## Why this should work
[brief reason]
```

## Output modes

### Mode A — Director Analysis

Use when the user asks for deep direction or story interpretation.

```markdown
# Director Analysis

## Dramatic core
[one paragraph]

## Subtext
[one paragraph]

## Emotional arc
[beginning → escalation → turn → ending]

## Visual thesis
[one concrete visual rule]

## Direction rules
- Camera:
- Lighting:
- Performance:
- Editing:
- Sound:
```

### Mode B — Shot Plan

Use when the user asks for storyboard/shot list/video execution table. Use `assets/shot-plan-template.md`.

### Mode C — Keyframe Prompt Pack

Use when the user needs image/keyframe prompts. Use `assets/keyframe-prompt-template.md`.

### Mode D — Video Motion Prompt Pack

Use when the user already has images/assets and wants video prompts. Use `assets/video-prompt-template.md`.

### Mode E — Continuity Bible

Use when the project has repeated characters/scenes. Use `references/continuity-bible.md`.

### Mode F — QC & Repair

Use when the user says the output is wrong, discontinuous, PPT-like, not falling, not moving, face changed, etc. Use `assets/qc-checklist.md`.

## Gotchas

- Do not equate “more cinematic adjectives” with better direction.
- Do not produce many beautiful shots that do not advance story.
- For AI video, avoid complex choreography in one prompt. Split actions into shorter clips.
- Do not write “camera rotates, zooms, pans, tracks, and shakes” unless the tool specifically supports that combination and the story needs it.
- If the user provides reference images, do not reinvent costume, face, era, or location.
- If the user says the clip looks like a slideshow, add subject action, environmental reaction, and camera motivation; do not merely add “cinematic motion.”
- If the user asks for horror/suspense, prefer controlled stillness, negative space, delayed revelation, and sound cues over constant camera movement.
- If the story is literary or satirical, preserve the idea. Do not turn every scene into generic genre spectacle.

## Minimal working example

User asks: “I have a keyframe of a man kneeling in a rainy old street. Make it into an AI video prompt.”

Good response:

```text
Low-angle medium close shot, locked camera with a very slow push-in. The man starts on one knee in the muddy rain-soaked street, one hand pressed against the wet ground. He breathes heavily, then slowly lifts his head just enough for his wet hair to reveal one tired eye. Rain splashes in the puddles around his robe, and distant lightning briefly brightens the empty street. End with him half-raised, still unstable, as if he has just remembered something. Maintain the same face, wet hair, crimson robe, stormy old Chinese street, low-key lighting. No text, no watermark, no modern objects, no extra people.
```

Bad response:

```text
Cinematic dark horror atmosphere, the man struggles in the rain and remembers his past, dramatic camera, emotional, high quality, masterpiece.
```
cinematic-director

Works with

Security Analysis

Attribution

Comments (0)