video-production-ledger

📁 bmcgauley/skills 📅 6 days ago
4
总安装量
3
周安装量
#48654
全站排名
安装命令
npx skills add https://github.com/bmcgauley/skills --skill video-production-ledger

Agent 安装分布

opencode 3
claude-code 3
github-copilot 3
codex 3
kimi-cli 3
gemini-cli 3

Skill 文档

Video Production Master Ledger

Transform content outlines and scripts into comprehensive, frame-by-frame production master ledgers that serve as the complete blueprint for video production and post-production.

Purpose and Overview

The Video Production Master Ledger system converts scripts and outlines into detailed production documents that specify every aspect of a video’s creation. Unlike simple shot lists or rough storyboards, master ledgers provide:

  • Frame-accurate timing for every visual and audio element
  • Multiple alternatives for A-roll, B-roll, and visual treatments
  • Complete audio specification including music, SFX, risers, hits, and drones
  • Viewer attention mapping showing exactly where focus should be directed
  • Engagement mechanics tied to the four-pillar framework
  • Technical specifications for effects, transitions, color grading, and motion

When to Use This Skill

Use this skill when:

  • Converting video scripts or outlines into production-ready specifications
  • Planning video content that requires precise engagement optimization
  • Creating detailed shot lists with timing, alternatives, and technical specs
  • Breaking down educational, entertainment, or marketing videos into actionable production plans
  • Documenting complete visual and audio workflows for video editors
  • Establishing production standards for consistent video quality

Core Framework: The Four Pillars of Addictive Editing

All production decisions in the master ledger align with these engagement principles:

Pillar 1: Healthy Variety of Visuals

Principle: Change what viewers see every few seconds to maintain attention without causing visual confusion.

Implementation:

  • Alternate between A-roll (subject on camera), B-roll (supplementary footage), motion graphics, and text overlays
  • Maintain 3-10 second shot durations for engaging content
  • Use longer durations (10+ seconds) only when footage inherently holds attention
  • Specify exact visual type and duration for each segment

Pillar 2: Visual Continuity

Principle: Create seamless visual flow where every element blends naturally into the next.

Implementation:

  • All graphics must animate into frame (never magically appear)
  • Maintain consistent focal points across cuts to avoid viewer eye jumping
  • Use transitions strategically to mark topic changes
  • Specify animation style (slide, fade, scale) for every graphic element

Pillar 3: Immersive Audio

Principle: Layer sound design to create double stimulation and emotional guidance.

Implementation:

  • Base music layer establishing mood for each segment
  • Movement sounds (whoosh) for all animated elements
  • Emphasis sounds (hits, highlights) for important moments
  • Emotional manipulation through risers (build tension), hits (release tension), and drones (create mystery)
  • Music synchronization with visual beats and topic transitions

Pillar 4: Healthy Pacing

Principle: Balance stimulation with comprehension, adjusting density to content type and audience expectation.

Implementation:

  • Entertainment content: Higher cut frequency, constant stimulation
  • Educational content: Moderate pacing with strategic emphasis
  • Authentic/vlog content: Minimal cuts, preserving natural flow
  • Specify exact pacing strategy for each segment

Master Ledger Structure

Document Format

Each master ledger is organized into these sections:

  1. Project Overview – High-level video metadata and strategy
  2. Segment Breakdown – Frame-by-frame specification of every moment
  3. Audio Architecture – Complete sound design mapping
  4. Visual Asset Requirements – Comprehensive list of needed footage/graphics
  5. Technical Specifications – Software settings, export specs, quality standards

Segment Specification Template

For each segment in the video, specify:

SEGMENT [Number]: [Title/Description]
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

TIMING
├─ Start: [MM:SS.ms]
├─ End: [MM:SS.ms]  
└─ Duration: [seconds]

SCRIPT/VOICEOVER
[Exact words spoken, with emphasis markers]

VISUAL PRIMARY
├─ Type: [A-roll / B-roll / Motion Graphic / Text / Hybrid]
├─ Description: [Detailed description of what viewer sees]
├─ Focal Point: [Where viewer eye should focus - coordinates or description]
├─ Duration on Screen: [seconds]
└─ Technical Notes: [Camera angle, framing, specific visual requirements]

VISUAL ALTERNATIVES (minimum 3)
Alt 1: [Type] - [Description] - [Why this works]
Alt 2: [Type] - [Description] - [Why this works]  
Alt 3: [Type] - [Description] - [Why this works]

VISUAL EFFECTS & TREATMENTS
├─ Color Grade: [Mood/style - e.g., "Warm, slightly desaturated"]
├─ Motion: [Scale/position animation - e.g., "Slow push-in, 100% to 105% over 3s"]
├─ Overlays: [Captions, graphics, vignettes - with timing]
├─ Transitions: [Type, duration, and justification]
└─ Focus Direction: [Methods - darken/blur/highlight/circle/arrow/animate]

AUDIO LAYERS
├─ Music: [Track name/mood] at [volume %] 
│   ├─ Sync Points: [Where music hits match visual beats]
│   └─ Fades: [In/out timing and duration]
├─ Dialogue: [Clean/processed, volume %, any effects]
├─ SFX Movement: [Whoosh/swipe sounds for animations] @ [timing]
├─ SFX Emphasis: [Hit/pop sounds for highlights] @ [timing]
├─ SFX Emotional: [Risers/drones] @ [timing] for [purpose]
└─ Ambient: [Background atmosphere if needed]

ENGAGEMENT MECHANICS
├─ Attention Hook: [What grabs viewer in first 0.5s]
├─ Viewer Emotion: [Target emotional state]
├─ Pillar Focus: [Which of 4 pillars is prioritized]
└─ Retention Strategy: [How this moment prevents drop-off]

EDITOR NOTES
[Any specific guidance, common pitfalls, or creative direction]

Production Workflow

Phase 1: Script Analysis and Strategic Planning

Input Required:

  • Complete video script or detailed outline
  • Target audience profile
  • Video purpose (entertainment/education/marketing/vlog)
  • Desired video length
  • Any existing brand guidelines or style preferences

Process:

  1. Read the complete script/outline
  2. Identify natural segment boundaries (topic shifts, tone changes)
  3. Determine primary engagement strategy based on content type
  4. Map emotional arc across the entire video
  5. Establish pacing rhythm (where to intensify, where to breathe)

Output:

  • Project Overview document with strategy summary
  • Segment list with high-level descriptions
  • Emotional/energy graph showing intensity over time

Phase 2: Detailed Segment Specification

For each segment identified in Phase 1:

Step 1: Timing Allocation

  • Assign precise start/end timestamps based on script reading speed
  • Account for natural pauses and emphasis
  • Build in breathing room between segments

Step 2: Visual Planning

  • Define primary visual type (A-roll, B-roll, graphic)
  • Create 3-5 alternative visual approaches
  • Specify focal points to guide viewer attention
  • Plan visual variety rhythm within segment

Step 3: Audio Layering

  • Select music that matches segment emotion
  • Identify all moments needing SFX
  • Plan riser/hit/drone placement for emotional manipulation
  • Sync audio peaks with visual moments

Step 4: Engagement Optimization

  • Apply four-pillar framework to each moment
  • Identify potential drop-off points and add hooks
  • Balance stimulation density with comprehension needs
  • Verify continuity with previous/next segments

Step 5: Technical Specification

  • Define all effects, color grades, motion parameters
  • Specify transition types and durations
  • List overlay timing and animation styles
  • Note any special technical requirements

Output: Complete segment specification using template format

Phase 3: Asset Requirements Compilation

Process:

  1. Extract all unique visual assets needed from segment specs
  2. Categorize by type (A-roll shots, B-roll footage, graphics, stock)
  3. Prioritize based on importance and difficulty to obtain
  4. Note alternatives if primary assets unavailable

Output:

  • Organized asset list with descriptions
  • Shot list for A-roll filming
  • B-roll footage requirements (film vs. stock)
  • Motion graphic briefs
  • Stock footage search queries

Phase 4: Technical Specification Sheet

Process:

  1. Compile all software requirements
  2. Establish quality standards (resolution, bitrate, format)
  3. Document color grading approach
  4. List required plugins, fonts, sound libraries
  5. Define export settings

Output: Technical specification document covering:

  • Software and versions
  • Project settings
  • Quality standards
  • Required assets (sounds, fonts, plugins)
  • Export specifications

Detailed Specification Guidelines

Visual Specification Standards

A-Roll Specifications Must Include:

  • Camera angle and framing (wide/medium/close-up)
  • Subject position in frame
  • Lighting mood
  • Background treatment (blur amount, color)
  • Eye contact point for viewer focus continuity
  • Duration and whether subject is static or moving

B-Roll Specifications Must Include:

  • Exact content description
  • Required motion (pan, tilt, static, handheld)
  • Color treatment to match segment mood
  • Specific visual details that matter to narrative
  • Duration and if looped
  • Relationship to voiceover (illustrative/metaphorical/explanatory)

Motion Graphic Specifications Must Include:

  • Content type (diagram, text, chart, icon, illustration)
  • Animation style (fade/slide/scale/perspective/custom)
  • Entry and exit timing and method
  • Color palette and relationship to brand
  • Complexity level (simple icon vs. detailed diagram)
  • Key information and visual hierarchy

Text Overlay Specifications Must Include:

  • Exact text content
  • Font, size, weight, color
  • Animation in/out (with timing)
  • Position on screen
  • Duration on screen
  • Emphasis techniques (color shift, scale, glow)
  • Maximum 3 words at once for readability

Audio Specification Standards

Music Selection Criteria:

  • Primary emotion (anticipation, excitement, mystery, comfort, intensity)
  • Energy level (1-10 scale)
  • Tempo range (BPM)
  • Instrumentation character (electronic, organic, hybrid)
  • Sync points with visual beats
  • Volume levels for each segment
  • Fade in/out timing (sudden, quick, gradual)

Sound Effect Categories:

  1. Movement SFX – For every animated element

    • Whoosh variations (light, medium, heavy)
    • Swipe sounds for transitions
    • Slide sounds for moving graphics
    • Timing: Sync with visual motion start
  2. Emphasis SFX – For important moments

    • Pop sounds for appearing elements
    • Hit sounds for impactful statements
    • Click sounds for interface elements
    • Timing: Exact frame alignment critical
  3. Emotional SFX – For mood manipulation

    • Risers (duration 0.5-3s) building into important moments
    • Hits (sharp, immediate) releasing tension at reveals
    • Drones (sustained, 5-30s) creating underlying atmosphere
    • Timing: Strategic placement based on emotional arc
  4. Ambient SFX – For environmental context

    • Background atmosphere appropriate to B-roll
    • Volume: Subtle, typically 10-30% of dialogue
    • Duration: Continuous through related segments

Timing Specification Standards

Timestamp Format: MM:SS.ms (00:00.000)

Timing Precision Requirements:

  • Segment boundaries: Frame-accurate (±0.033s)
  • SFX placement: Frame-accurate
  • Music sync points: Frame-accurate
  • Visual cuts: Frame-accurate
  • Text appearance: Within 0.1s
  • Transitions: Exact duration specified

Reading Speed Standards:

  • Casual speaking: 150-160 words per minute
  • Enthusiastic/fast: 170-190 words per minute
  • Measured/serious: 130-140 words per minute
  • Calculate segment duration by word count ÷ speaking rate

Engagement Mechanics Specification

For each segment, explicitly define:

Attention Hook – First 0.5s element that captures focus

  • Options: Sudden motion, contrasting color, unexpected sound, text emphasis, scale change
  • Specify exact technique and timing

Viewer Emotion Target – Desired emotional state

  • Categories: Curious, excited, informed, empowered, entertained, contemplative, energized
  • Link to music and pacing choices

Primary Pillar – Which engagement pillar is emphasized

  • Variety: When introducing new information or building energy
  • Continuity: When explaining complex ideas requiring focus
  • Audio: When manipulating emotion or building anticipation
  • Pacing: When matching audience expectations for content type

Retention Strategy – Specific technique preventing drop-off

  • Pattern interruption (sudden change after repetition)
  • Curiosity gap (promising information to come)
  • Emotional payoff (delivering satisfying conclusion)
  • Momentum building (increasing energy toward next segment)

Reference Files

Audio Element Library Reference

For detailed sound effect selection guidance and categorization, see:

  • references/audio_elements_guide.md – Comprehensive SFX library organization
  • references/music_mood_matrix.md – Music selection framework by emotional target

Visual Treatment Library Reference

For specific effect implementation and parameter recommendations, see:

  • references/visual_effects_catalog.md – Complete effects library with use cases
  • references/motion_graphics_templates.md – Standard animation patterns

Example Master Ledgers

For complete production ledger examples across different video types, see:

  • references/example_educational_ledger.md – Detailed educational content breakdown
  • references/example_entertainment_ledger.md – High-energy entertainment specification
  • references/example_vlog_ledger.md – Authentic vlog minimal production approach

Quality Standards

Completeness Checklist

A production-ready master ledger must include:

  • Complete timing for every segment (no gaps)
  • Primary visual specified for every moment
  • Minimum 3 alternatives per primary visual
  • Audio layers defined for every segment
  • At least one SFX per 5 seconds of content
  • Music track specified with volume and sync points
  • Focal point identified for every visual
  • Engagement mechanics defined per segment
  • All transitions specified with type and duration
  • Technical specifications for all effects
  • Complete asset requirements list
  • Editor notes for complex or critical moments

Review Process

Before delivering master ledger:

  1. Timing Review – Verify no gaps, overlaps, or timing conflicts
  2. Continuity Review – Check focal point flow across cuts
  3. Audio Sync Review – Confirm music hits align with visual moments
  4. Alternative Viability – Ensure all alternatives are genuinely usable
  5. Technical Feasibility – Verify all specifications are achievable
  6. Engagement Density – Confirm attention maintenance throughout
  7. Brand Alignment – Check consistency with any provided guidelines

Output Format

Deliver master ledgers as structured markdown documents with:

  1. Project Overview Section at top
  2. Segment Specifications in chronological order
  3. Audio Architecture Map showing all music and SFX timing
  4. Visual Asset Requirements comprehensive list
  5. Technical Specifications complete software/export details

Use consistent formatting with clear hierarchy:

  • H1 for document title
  • H2 for major sections
  • H3 for segment titles
  • Clear visual dividers between segments
  • Code blocks or tables for technical specifications
  • Emphasis formatting for critical notes

Advanced Techniques

Sync Mapping for Music-Driven Segments

When segment energy should match music perfectly:

  1. Load music track and identify beat/measure boundaries
  2. Align segment start with strong beat or measure start
  3. Map visual hits (cuts, reveals, text) to music hits
  4. Place risers before major music builds
  5. Time transitions to finish on beat resolutions

Emotional Arc Continuity

Maintain emotional flow across segments:

  1. Map desired emotional intensity (1-10) for each segment
  2. Never jump more than 3 points between segments
  3. Use transition type to bridge emotional gaps:
    • Small gap (1-2 points): Simple cut
    • Medium gap (2-3 points): Quick transition with SFX
    • Large gap (4+ points): Requires intermediary segment

Viewer Attention Heatmapping

Predict where viewer attention goes:

  1. Mark primary focal point for each visual
  2. Track eye position across cuts
  3. Flag any jump larger than 30% of screen width
  4. Add guidance elements (motion, highlight) when jumps occur
  5. Maintain focal continuity for important information

Cognitive Load Management

Balance information delivery with engagement:

  1. Rate each segment for cognitive demand (1-10)
  2. High-demand segments (7-10): Reduce visual variety, maintain continuity
  3. Low-demand segments (1-3): Increase variety, add stylistic elements
  4. Medium-demand segments (4-6): Balance based on audience sophistication
  5. Never exceed sustained cognitive load of 8 for more than 30 seconds

Common Patterns and Templates

Opening Hook Pattern (First 5-10 seconds)

SEGMENT 1: HOOK
Timing: 00:00.000 - 00:08.000

Visual: Fast-paced montage of compelling B-roll
- 5-7 quick cuts (1-2s each)
- High-energy visuals showing end result or transformation
- Each cut must contain movement or strong visual interest

Audio:
- Energetic music with strong beat
- No dialogue or minimal teaser dialogue
- Synchronized cutting to music beats
- Optional riser building toward title card

Purpose: Immediate engagement, sets energy level, promises value

Explanation Pattern (Teaching Complex Concept)

SEGMENT X: CONCEPT EXPLANATION
Timing: [Start] - [End]

Visual Primary: Motion graphic breaking down concept
- Start simple, build complexity progressively
- Use 3-5 second pauses at each build step
- Highlight new elements with color/animation
- Keep existing elements visible but de-emphasized

Visual Support: Picture-in-picture A-roll in corner
- Shows presenter explaining, builds connection
- Small enough to not distract from main graphic
- Can be removed during complex graphic moments

Audio:
- Music: Lower energy, more contemplative
- Volume: 30% during explanation, 50% during pauses
- SFX: Highlight sound for each new element introduced
- No risers/hits that might distract from learning

Purpose: Clear understanding, retention of complex information

Transition Between Topics Pattern

SEGMENT X: TOPIC TRANSITION
Timing: [Start] - [End] (typically 1-3 seconds)

Visual: Full-screen transition effect
- Type matches energy level and topic relationship
- Duration: 0.5-1.5s for related topics, 1-3s for major shifts
- Can include text card announcing new topic

Audio:
- Music: Fades out end of Topic A, fades in for Topic B
- Optional: Hit sound at transition point
- If major shift: Brief moment of silence (0.2-0.5s)

Purpose: Signal change, reset attention, prepare viewer for new context

Troubleshooting Common Issues

Issue: Too Much Visual Variety (Viewer Confusion)

Symptoms: Many cuts under 2 seconds, constant change without purpose, no focal point consistency

Solutions:

  • Extend shots with inherent interest to 5-8 seconds
  • Group related information with single visual
  • Establish consistent focal point across 2-3 related cuts
  • Reduce transition effects

Issue: Too Little Visual Variety (Viewer Boredom)

Symptoms: Long stretches without visual change, static shots over 10 seconds, repetitive visual patterns

Solutions:

  • Add motion graphics overlay to static shots
  • Implement slow push-in or pan on still shots
  • Cut to related B-roll every 5-7 seconds
  • Introduce text overlays for key points

Issue: Audio-Visual Desynchronization

Symptoms: Music hits not matching visual moments, SFX feeling detached, emotional tone mismatch

Solutions:

  • Map music beat grid and align cuts to beats
  • Place all SFX at exact frame of visual event
  • Adjust music track timing or find better sync point
  • Consider music edit (splice) if structural mismatch exists

Issue: Cognitive Overload

Symptoms: Too much happening simultaneously, competing focal points, insufficient processing time

Solutions:

  • Reduce SFX density in information-heavy segments
  • Extend duration of complex motion graphics
  • Remove text overlays during B-roll storytelling
  • Simplify color grading to reduce visual noise

Issue: Engagement Drop-off at Specific Points

Symptoms: Predictable viewer exit points, long segments without hooks, monotonous pacing

Solutions:

  • Add pattern interruption (sudden change) every 8-10 seconds
  • Increase music energy leading into weak segment
  • Insert compelling B-roll or motion graphic
  • Add riser before important information to build anticipation

Integration with Production Pipeline

Pre-Production Phase

Master ledger informs:

  • Shot list creation for filming
  • B-roll footage requirements
  • Motion graphics briefs for designers
  • Music licensing and selection
  • SFX library acquisition

Production Phase

Master ledger guides:

  • A-roll filming priorities and setup
  • B-roll capture shopping list
  • On-set timing considerations
  • Contingency planning for missed shots

Post-Production Phase

Master ledger serves as:

  • Editor’s primary reference document
  • Timeline structure blueprint
  • Audio mix specification
  • QA checklist for completeness
  • Revision communication tool

Best Practices

Do’s

✅ Specify alternatives: Always provide 3-5 visual options per segment ✅ Time precisely: Use frame-accurate timestamps for all elements ✅ Layer audio: Multiple audio elements create richness ✅ Guide focus: Explicitly state where viewer eye should be ✅ Connect emotion: Link all creative choices to target emotional state ✅ Plan continuity: Consider flow across cuts, not just within shots ✅ Justify choices: Explain “why” in editor notes when non-obvious ✅ Build flexibility: Structure allows substitutions without breaking flow

Don’ts

❌ Don’t under-specify: “Add B-roll here” is insufficient – describe exactly what B-roll ❌ Don’t ignore timing: Vague durations lead to pacing problems ❌ Don’t forget audio: Visual-only specs miss half the engagement equation
❌ Don’t neglect alternatives: Single-option specs fail when assets unavailable ❌ Don’t break continuity: Jumping focal points destroy immersion ❌ Don’t over-complicate simple moments: Not every second needs maximum stimulation ❌ Don’t contradict pillars: Every choice should support engagement framework ❌ Don’t create in isolation: Each segment must connect to adjacent segments

Success Metrics

A high-quality master ledger enables:

  • Editor efficiency: 50%+ faster editing with clear specifications
  • First-cut accuracy: 80%+ of choices match creative intent
  • Asset optimization: Zero missing assets, minimal unused footage
  • Revision reduction: Clear specs reduce back-and-forth iterations
  • Quality consistency: Systematic approach ensures professional results
  • Scalability: Framework applies to videos of any length or type

Conclusion

The Video Production Master Ledger transforms the editing process from creative guesswork into systematic execution of a proven engagement framework. By specifying every visual, audio, timing, and technical element with precision and alternatives, the master ledger ensures editors can create compelling, attention-holding content that serves the video’s purpose while maintaining production efficiency.

The four-pillar framework underlying all specifications—visual variety, visual continuity, immersive audio, and healthy pacing—provides a unified theory of viewer engagement that guides every creative decision from conceptual outline to final export.