video-production-ledger
npx skills add https://github.com/bmcgauley/skills --skill video-production-ledger
Agent 安装分布
Skill 文档
Video Production Master Ledger
Transform content outlines and scripts into comprehensive, frame-by-frame production master ledgers that serve as the complete blueprint for video production and post-production.
Purpose and Overview
The Video Production Master Ledger system converts scripts and outlines into detailed production documents that specify every aspect of a video’s creation. Unlike simple shot lists or rough storyboards, master ledgers provide:
- Frame-accurate timing for every visual and audio element
- Multiple alternatives for A-roll, B-roll, and visual treatments
- Complete audio specification including music, SFX, risers, hits, and drones
- Viewer attention mapping showing exactly where focus should be directed
- Engagement mechanics tied to the four-pillar framework
- Technical specifications for effects, transitions, color grading, and motion
When to Use This Skill
Use this skill when:
- Converting video scripts or outlines into production-ready specifications
- Planning video content that requires precise engagement optimization
- Creating detailed shot lists with timing, alternatives, and technical specs
- Breaking down educational, entertainment, or marketing videos into actionable production plans
- Documenting complete visual and audio workflows for video editors
- Establishing production standards for consistent video quality
Core Framework: The Four Pillars of Addictive Editing
All production decisions in the master ledger align with these engagement principles:
Pillar 1: Healthy Variety of Visuals
Principle: Change what viewers see every few seconds to maintain attention without causing visual confusion.
Implementation:
- Alternate between A-roll (subject on camera), B-roll (supplementary footage), motion graphics, and text overlays
- Maintain 3-10 second shot durations for engaging content
- Use longer durations (10+ seconds) only when footage inherently holds attention
- Specify exact visual type and duration for each segment
Pillar 2: Visual Continuity
Principle: Create seamless visual flow where every element blends naturally into the next.
Implementation:
- All graphics must animate into frame (never magically appear)
- Maintain consistent focal points across cuts to avoid viewer eye jumping
- Use transitions strategically to mark topic changes
- Specify animation style (slide, fade, scale) for every graphic element
Pillar 3: Immersive Audio
Principle: Layer sound design to create double stimulation and emotional guidance.
Implementation:
- Base music layer establishing mood for each segment
- Movement sounds (whoosh) for all animated elements
- Emphasis sounds (hits, highlights) for important moments
- Emotional manipulation through risers (build tension), hits (release tension), and drones (create mystery)
- Music synchronization with visual beats and topic transitions
Pillar 4: Healthy Pacing
Principle: Balance stimulation with comprehension, adjusting density to content type and audience expectation.
Implementation:
- Entertainment content: Higher cut frequency, constant stimulation
- Educational content: Moderate pacing with strategic emphasis
- Authentic/vlog content: Minimal cuts, preserving natural flow
- Specify exact pacing strategy for each segment
Master Ledger Structure
Document Format
Each master ledger is organized into these sections:
- Project Overview – High-level video metadata and strategy
- Segment Breakdown – Frame-by-frame specification of every moment
- Audio Architecture – Complete sound design mapping
- Visual Asset Requirements – Comprehensive list of needed footage/graphics
- Technical Specifications – Software settings, export specs, quality standards
Segment Specification Template
For each segment in the video, specify:
SEGMENT [Number]: [Title/Description]
âââââââââââââââââââââââââââââââââââââââââ
TIMING
ââ Start: [MM:SS.ms]
ââ End: [MM:SS.ms]
ââ Duration: [seconds]
SCRIPT/VOICEOVER
[Exact words spoken, with emphasis markers]
VISUAL PRIMARY
ââ Type: [A-roll / B-roll / Motion Graphic / Text / Hybrid]
ââ Description: [Detailed description of what viewer sees]
ââ Focal Point: [Where viewer eye should focus - coordinates or description]
ââ Duration on Screen: [seconds]
ââ Technical Notes: [Camera angle, framing, specific visual requirements]
VISUAL ALTERNATIVES (minimum 3)
Alt 1: [Type] - [Description] - [Why this works]
Alt 2: [Type] - [Description] - [Why this works]
Alt 3: [Type] - [Description] - [Why this works]
VISUAL EFFECTS & TREATMENTS
ââ Color Grade: [Mood/style - e.g., "Warm, slightly desaturated"]
ââ Motion: [Scale/position animation - e.g., "Slow push-in, 100% to 105% over 3s"]
ââ Overlays: [Captions, graphics, vignettes - with timing]
ââ Transitions: [Type, duration, and justification]
ââ Focus Direction: [Methods - darken/blur/highlight/circle/arrow/animate]
AUDIO LAYERS
ââ Music: [Track name/mood] at [volume %]
â ââ Sync Points: [Where music hits match visual beats]
â ââ Fades: [In/out timing and duration]
ââ Dialogue: [Clean/processed, volume %, any effects]
ââ SFX Movement: [Whoosh/swipe sounds for animations] @ [timing]
ââ SFX Emphasis: [Hit/pop sounds for highlights] @ [timing]
ââ SFX Emotional: [Risers/drones] @ [timing] for [purpose]
ââ Ambient: [Background atmosphere if needed]
ENGAGEMENT MECHANICS
ââ Attention Hook: [What grabs viewer in first 0.5s]
ââ Viewer Emotion: [Target emotional state]
ââ Pillar Focus: [Which of 4 pillars is prioritized]
ââ Retention Strategy: [How this moment prevents drop-off]
EDITOR NOTES
[Any specific guidance, common pitfalls, or creative direction]
Production Workflow
Phase 1: Script Analysis and Strategic Planning
Input Required:
- Complete video script or detailed outline
- Target audience profile
- Video purpose (entertainment/education/marketing/vlog)
- Desired video length
- Any existing brand guidelines or style preferences
Process:
- Read the complete script/outline
- Identify natural segment boundaries (topic shifts, tone changes)
- Determine primary engagement strategy based on content type
- Map emotional arc across the entire video
- Establish pacing rhythm (where to intensify, where to breathe)
Output:
- Project Overview document with strategy summary
- Segment list with high-level descriptions
- Emotional/energy graph showing intensity over time
Phase 2: Detailed Segment Specification
For each segment identified in Phase 1:
Step 1: Timing Allocation
- Assign precise start/end timestamps based on script reading speed
- Account for natural pauses and emphasis
- Build in breathing room between segments
Step 2: Visual Planning
- Define primary visual type (A-roll, B-roll, graphic)
- Create 3-5 alternative visual approaches
- Specify focal points to guide viewer attention
- Plan visual variety rhythm within segment
Step 3: Audio Layering
- Select music that matches segment emotion
- Identify all moments needing SFX
- Plan riser/hit/drone placement for emotional manipulation
- Sync audio peaks with visual moments
Step 4: Engagement Optimization
- Apply four-pillar framework to each moment
- Identify potential drop-off points and add hooks
- Balance stimulation density with comprehension needs
- Verify continuity with previous/next segments
Step 5: Technical Specification
- Define all effects, color grades, motion parameters
- Specify transition types and durations
- List overlay timing and animation styles
- Note any special technical requirements
Output: Complete segment specification using template format
Phase 3: Asset Requirements Compilation
Process:
- Extract all unique visual assets needed from segment specs
- Categorize by type (A-roll shots, B-roll footage, graphics, stock)
- Prioritize based on importance and difficulty to obtain
- Note alternatives if primary assets unavailable
Output:
- Organized asset list with descriptions
- Shot list for A-roll filming
- B-roll footage requirements (film vs. stock)
- Motion graphic briefs
- Stock footage search queries
Phase 4: Technical Specification Sheet
Process:
- Compile all software requirements
- Establish quality standards (resolution, bitrate, format)
- Document color grading approach
- List required plugins, fonts, sound libraries
- Define export settings
Output: Technical specification document covering:
- Software and versions
- Project settings
- Quality standards
- Required assets (sounds, fonts, plugins)
- Export specifications
Detailed Specification Guidelines
Visual Specification Standards
A-Roll Specifications Must Include:
- Camera angle and framing (wide/medium/close-up)
- Subject position in frame
- Lighting mood
- Background treatment (blur amount, color)
- Eye contact point for viewer focus continuity
- Duration and whether subject is static or moving
B-Roll Specifications Must Include:
- Exact content description
- Required motion (pan, tilt, static, handheld)
- Color treatment to match segment mood
- Specific visual details that matter to narrative
- Duration and if looped
- Relationship to voiceover (illustrative/metaphorical/explanatory)
Motion Graphic Specifications Must Include:
- Content type (diagram, text, chart, icon, illustration)
- Animation style (fade/slide/scale/perspective/custom)
- Entry and exit timing and method
- Color palette and relationship to brand
- Complexity level (simple icon vs. detailed diagram)
- Key information and visual hierarchy
Text Overlay Specifications Must Include:
- Exact text content
- Font, size, weight, color
- Animation in/out (with timing)
- Position on screen
- Duration on screen
- Emphasis techniques (color shift, scale, glow)
- Maximum 3 words at once for readability
Audio Specification Standards
Music Selection Criteria:
- Primary emotion (anticipation, excitement, mystery, comfort, intensity)
- Energy level (1-10 scale)
- Tempo range (BPM)
- Instrumentation character (electronic, organic, hybrid)
- Sync points with visual beats
- Volume levels for each segment
- Fade in/out timing (sudden, quick, gradual)
Sound Effect Categories:
-
Movement SFX – For every animated element
- Whoosh variations (light, medium, heavy)
- Swipe sounds for transitions
- Slide sounds for moving graphics
- Timing: Sync with visual motion start
-
Emphasis SFX – For important moments
- Pop sounds for appearing elements
- Hit sounds for impactful statements
- Click sounds for interface elements
- Timing: Exact frame alignment critical
-
Emotional SFX – For mood manipulation
- Risers (duration 0.5-3s) building into important moments
- Hits (sharp, immediate) releasing tension at reveals
- Drones (sustained, 5-30s) creating underlying atmosphere
- Timing: Strategic placement based on emotional arc
-
Ambient SFX – For environmental context
- Background atmosphere appropriate to B-roll
- Volume: Subtle, typically 10-30% of dialogue
- Duration: Continuous through related segments
Timing Specification Standards
Timestamp Format: MM:SS.ms (00:00.000)
Timing Precision Requirements:
- Segment boundaries: Frame-accurate (±0.033s)
- SFX placement: Frame-accurate
- Music sync points: Frame-accurate
- Visual cuts: Frame-accurate
- Text appearance: Within 0.1s
- Transitions: Exact duration specified
Reading Speed Standards:
- Casual speaking: 150-160 words per minute
- Enthusiastic/fast: 170-190 words per minute
- Measured/serious: 130-140 words per minute
- Calculate segment duration by word count ÷ speaking rate
Engagement Mechanics Specification
For each segment, explicitly define:
Attention Hook – First 0.5s element that captures focus
- Options: Sudden motion, contrasting color, unexpected sound, text emphasis, scale change
- Specify exact technique and timing
Viewer Emotion Target – Desired emotional state
- Categories: Curious, excited, informed, empowered, entertained, contemplative, energized
- Link to music and pacing choices
Primary Pillar – Which engagement pillar is emphasized
- Variety: When introducing new information or building energy
- Continuity: When explaining complex ideas requiring focus
- Audio: When manipulating emotion or building anticipation
- Pacing: When matching audience expectations for content type
Retention Strategy – Specific technique preventing drop-off
- Pattern interruption (sudden change after repetition)
- Curiosity gap (promising information to come)
- Emotional payoff (delivering satisfying conclusion)
- Momentum building (increasing energy toward next segment)
Reference Files
Audio Element Library Reference
For detailed sound effect selection guidance and categorization, see:
references/audio_elements_guide.md– Comprehensive SFX library organizationreferences/music_mood_matrix.md– Music selection framework by emotional target
Visual Treatment Library Reference
For specific effect implementation and parameter recommendations, see:
references/visual_effects_catalog.md– Complete effects library with use casesreferences/motion_graphics_templates.md– Standard animation patterns
Example Master Ledgers
For complete production ledger examples across different video types, see:
references/example_educational_ledger.md– Detailed educational content breakdownreferences/example_entertainment_ledger.md– High-energy entertainment specificationreferences/example_vlog_ledger.md– Authentic vlog minimal production approach
Quality Standards
Completeness Checklist
A production-ready master ledger must include:
- Complete timing for every segment (no gaps)
- Primary visual specified for every moment
- Minimum 3 alternatives per primary visual
- Audio layers defined for every segment
- At least one SFX per 5 seconds of content
- Music track specified with volume and sync points
- Focal point identified for every visual
- Engagement mechanics defined per segment
- All transitions specified with type and duration
- Technical specifications for all effects
- Complete asset requirements list
- Editor notes for complex or critical moments
Review Process
Before delivering master ledger:
- Timing Review – Verify no gaps, overlaps, or timing conflicts
- Continuity Review – Check focal point flow across cuts
- Audio Sync Review – Confirm music hits align with visual moments
- Alternative Viability – Ensure all alternatives are genuinely usable
- Technical Feasibility – Verify all specifications are achievable
- Engagement Density – Confirm attention maintenance throughout
- Brand Alignment – Check consistency with any provided guidelines
Output Format
Deliver master ledgers as structured markdown documents with:
- Project Overview Section at top
- Segment Specifications in chronological order
- Audio Architecture Map showing all music and SFX timing
- Visual Asset Requirements comprehensive list
- Technical Specifications complete software/export details
Use consistent formatting with clear hierarchy:
- H1 for document title
- H2 for major sections
- H3 for segment titles
- Clear visual dividers between segments
- Code blocks or tables for technical specifications
- Emphasis formatting for critical notes
Advanced Techniques
Sync Mapping for Music-Driven Segments
When segment energy should match music perfectly:
- Load music track and identify beat/measure boundaries
- Align segment start with strong beat or measure start
- Map visual hits (cuts, reveals, text) to music hits
- Place risers before major music builds
- Time transitions to finish on beat resolutions
Emotional Arc Continuity
Maintain emotional flow across segments:
- Map desired emotional intensity (1-10) for each segment
- Never jump more than 3 points between segments
- Use transition type to bridge emotional gaps:
- Small gap (1-2 points): Simple cut
- Medium gap (2-3 points): Quick transition with SFX
- Large gap (4+ points): Requires intermediary segment
Viewer Attention Heatmapping
Predict where viewer attention goes:
- Mark primary focal point for each visual
- Track eye position across cuts
- Flag any jump larger than 30% of screen width
- Add guidance elements (motion, highlight) when jumps occur
- Maintain focal continuity for important information
Cognitive Load Management
Balance information delivery with engagement:
- Rate each segment for cognitive demand (1-10)
- High-demand segments (7-10): Reduce visual variety, maintain continuity
- Low-demand segments (1-3): Increase variety, add stylistic elements
- Medium-demand segments (4-6): Balance based on audience sophistication
- Never exceed sustained cognitive load of 8 for more than 30 seconds
Common Patterns and Templates
Opening Hook Pattern (First 5-10 seconds)
SEGMENT 1: HOOK
Timing: 00:00.000 - 00:08.000
Visual: Fast-paced montage of compelling B-roll
- 5-7 quick cuts (1-2s each)
- High-energy visuals showing end result or transformation
- Each cut must contain movement or strong visual interest
Audio:
- Energetic music with strong beat
- No dialogue or minimal teaser dialogue
- Synchronized cutting to music beats
- Optional riser building toward title card
Purpose: Immediate engagement, sets energy level, promises value
Explanation Pattern (Teaching Complex Concept)
SEGMENT X: CONCEPT EXPLANATION
Timing: [Start] - [End]
Visual Primary: Motion graphic breaking down concept
- Start simple, build complexity progressively
- Use 3-5 second pauses at each build step
- Highlight new elements with color/animation
- Keep existing elements visible but de-emphasized
Visual Support: Picture-in-picture A-roll in corner
- Shows presenter explaining, builds connection
- Small enough to not distract from main graphic
- Can be removed during complex graphic moments
Audio:
- Music: Lower energy, more contemplative
- Volume: 30% during explanation, 50% during pauses
- SFX: Highlight sound for each new element introduced
- No risers/hits that might distract from learning
Purpose: Clear understanding, retention of complex information
Transition Between Topics Pattern
SEGMENT X: TOPIC TRANSITION
Timing: [Start] - [End] (typically 1-3 seconds)
Visual: Full-screen transition effect
- Type matches energy level and topic relationship
- Duration: 0.5-1.5s for related topics, 1-3s for major shifts
- Can include text card announcing new topic
Audio:
- Music: Fades out end of Topic A, fades in for Topic B
- Optional: Hit sound at transition point
- If major shift: Brief moment of silence (0.2-0.5s)
Purpose: Signal change, reset attention, prepare viewer for new context
Troubleshooting Common Issues
Issue: Too Much Visual Variety (Viewer Confusion)
Symptoms: Many cuts under 2 seconds, constant change without purpose, no focal point consistency
Solutions:
- Extend shots with inherent interest to 5-8 seconds
- Group related information with single visual
- Establish consistent focal point across 2-3 related cuts
- Reduce transition effects
Issue: Too Little Visual Variety (Viewer Boredom)
Symptoms: Long stretches without visual change, static shots over 10 seconds, repetitive visual patterns
Solutions:
- Add motion graphics overlay to static shots
- Implement slow push-in or pan on still shots
- Cut to related B-roll every 5-7 seconds
- Introduce text overlays for key points
Issue: Audio-Visual Desynchronization
Symptoms: Music hits not matching visual moments, SFX feeling detached, emotional tone mismatch
Solutions:
- Map music beat grid and align cuts to beats
- Place all SFX at exact frame of visual event
- Adjust music track timing or find better sync point
- Consider music edit (splice) if structural mismatch exists
Issue: Cognitive Overload
Symptoms: Too much happening simultaneously, competing focal points, insufficient processing time
Solutions:
- Reduce SFX density in information-heavy segments
- Extend duration of complex motion graphics
- Remove text overlays during B-roll storytelling
- Simplify color grading to reduce visual noise
Issue: Engagement Drop-off at Specific Points
Symptoms: Predictable viewer exit points, long segments without hooks, monotonous pacing
Solutions:
- Add pattern interruption (sudden change) every 8-10 seconds
- Increase music energy leading into weak segment
- Insert compelling B-roll or motion graphic
- Add riser before important information to build anticipation
Integration with Production Pipeline
Pre-Production Phase
Master ledger informs:
- Shot list creation for filming
- B-roll footage requirements
- Motion graphics briefs for designers
- Music licensing and selection
- SFX library acquisition
Production Phase
Master ledger guides:
- A-roll filming priorities and setup
- B-roll capture shopping list
- On-set timing considerations
- Contingency planning for missed shots
Post-Production Phase
Master ledger serves as:
- Editor’s primary reference document
- Timeline structure blueprint
- Audio mix specification
- QA checklist for completeness
- Revision communication tool
Best Practices
Do’s
â Specify alternatives: Always provide 3-5 visual options per segment â Time precisely: Use frame-accurate timestamps for all elements â Layer audio: Multiple audio elements create richness â Guide focus: Explicitly state where viewer eye should be â Connect emotion: Link all creative choices to target emotional state â Plan continuity: Consider flow across cuts, not just within shots â Justify choices: Explain “why” in editor notes when non-obvious â Build flexibility: Structure allows substitutions without breaking flow
Don’ts
â Don’t under-specify: “Add B-roll here” is insufficient – describe exactly what B-roll
â Don’t ignore timing: Vague durations lead to pacing problems
â Don’t forget audio: Visual-only specs miss half the engagement equation
â Don’t neglect alternatives: Single-option specs fail when assets unavailable
â Don’t break continuity: Jumping focal points destroy immersion
â Don’t over-complicate simple moments: Not every second needs maximum stimulation
â Don’t contradict pillars: Every choice should support engagement framework
â Don’t create in isolation: Each segment must connect to adjacent segments
Success Metrics
A high-quality master ledger enables:
- Editor efficiency: 50%+ faster editing with clear specifications
- First-cut accuracy: 80%+ of choices match creative intent
- Asset optimization: Zero missing assets, minimal unused footage
- Revision reduction: Clear specs reduce back-and-forth iterations
- Quality consistency: Systematic approach ensures professional results
- Scalability: Framework applies to videos of any length or type
Conclusion
The Video Production Master Ledger transforms the editing process from creative guesswork into systematic execution of a proven engagement framework. By specifying every visual, audio, timing, and technical element with precision and alternatives, the master ledger ensures editors can create compelling, attention-holding content that serves the video’s purpose while maintaining production efficiency.
The four-pillar framework underlying all specificationsâvisual variety, visual continuity, immersive audio, and healthy pacingâprovides a unified theory of viewer engagement that guides every creative decision from conceptual outline to final export.