audiobook-production
npx skills add https://github.com/guia-matthieu/clawfu-skills --skill audiobook-production
Agent 安装分布
Skill 文档
Audiobook Production
Master ACX-compliant audiobook production using Audible’s technical standards and Richard Mayer’s multimedia learning principles for engaging, professional audio.
When to Use This Skill
- Producing an audiobook for Audible/ACX distribution
- Creating audio versions of courses or written content
- Setting up narration workflow for long-form content
- Ensuring audio meets professional distribution standards
- Self-producing an audiobook as an author
- Managing audiobook production projects
Methodology Foundation
Source: ACX (Audiobook Creation Exchange) + Richard Mayer (Multimedia Learning)
Core Principle: Professional audiobook production balances technical excellence (meeting strict ACX standards) with engaging delivery (applying learning science principles). As Mayer’s research shows: “People learn better from friendly human voice than machine voice” (Voice Principle).
Why This Matters: ACX/Audible rejects audiobooks that don’t meet technical specifications, wasting production time and money. Beyond technical compliance, understanding how listeners process audio enables production choices that maximize engagement and retention.
What Claude Does vs What You Decide
| Claude Does | You Decide |
|---|---|
| Structures production workflow | Final creative direction |
| Suggests technical approaches | Equipment and tool choices |
| Creates templates and checklists | Quality standards |
| Identifies best practices | Brand/voice decisions |
| Generates script outlines | Final script approval |
What This Skill Does
- Ensures ACX technical compliance – Specs for bit rate, sample rate, RMS, peaks, and noise floor
- Structures audiobook projects – Chapter organization, credits, retail samples
- Applies learning principles – Pacing, tone, and delivery that aid comprehension
- Manages production workflow – From script to final master
- Handles proofing and QC – Catching errors before submission
How to Use
Check ACX Compliance
Help me verify this audiobook meets ACX submission requirements.
[provide audio specs or describe setup]
Plan Audiobook Structure
Help me structure an audiobook from this manuscript:
Total word count: [X]
Chapters: [Y]
Type: [Fiction/Non-fiction]
Create Production Workflow
Create a production plan for my audiobook project:
Book length: [word count]
Narrator: [self/hired]
Deadline: [date]
Instructions
When producing audiobooks, follow this methodology:
Step 1: Understand ACX Technical Requirements
ACX has strict specifications that must be met for acceptance.
## ACX Audio Specifications (2026)
### File Format
- MP3, 192 kbps or higher (constant bit rate)
- 44.1 kHz sample rate
- Mono (single channel)
### Audio Levels
- RMS: -23dB to -18dB
- Peak values: -3dB maximum (no clipping)
- Noise floor: -60dB or lower
### File Naming
- Chapter files: [BookTitle]_[ChapterNumber].mp3
- Opening credits: [BookTitle]_Opening_Credits.mp3
- Closing credits: [BookTitle]_Closing_Credits.mp3
### Content Requirements
- Opening credits (first file): Title, author, narrator
- Closing credits (last file): "This has been [title] by [author], narrated by [narrator]"
- Each chapter file is standalone (no cross-file audio)
- Consistent room tone throughout
### Quality Requirements
- No extraneous sounds (mouth clicks, pops, page turns)
- Consistent audio quality across all files
- Retail sample: ~5 minutes of representative content
Critical: Run ACX Check tool on every file before submission.
Step 2: Prepare the Manuscript
Before recording, optimize the text for audio delivery.
## Manuscript Preparation Checklist
### Structure
â¡ Chapter breaks clearly marked
â¡ Section breaks identified (longer pause points)
â¡ Footnotes converted to in-text explanations or removed
â¡ Visual content (charts, tables) adapted for audio or noted for skip
### Pronunciation Guide
â¡ Character names with phonetic spelling
â¡ Foreign words and phrases
â¡ Technical terms and jargon
â¡ Place names and proper nouns
â¡ Acronyms expanded or spelled out
### Narrator Notes
â¡ Tone shifts marked
â¡ Character voice cues (if fiction)
â¡ Emphasis suggestions (minimal)
â¡ "As shown in Figure X" â adapted for audio
â¡ Author asides or commentary identified
### Timing Estimate
â¡ Word count per chapter
â¡ Total word count
⡠Estimated runtime: words ÷ 9,300 = hours (approx)
- 50,000 words â 5-6 hours
- 80,000 words â 8-9 hours
Step 3: Set Up Recording Environment
Consistent, quality recording requires proper setup.
## Home Studio Essentials
### Minimum Equipment
- **Microphone**: USB condenser (AT2020, Blue Yeti) or XLR setup
- **Pop filter**: Essential for plosives
- **Mic stand**: Stable positioning
- **Headphones**: Closed-back, wired
### Room Treatment
- Record in smallest, most furnished room available
- Hang blankets on hard surfaces
- Use reflection filter behind mic
- Close windows, turn off HVAC
- Eliminate electronic hum (fridges, computers)
### Recording Software (DAW)
- Free: Audacity, GarageBand
- Professional: Adobe Audition, Reaper, Pro Tools
- ACX-specific: Hindenburg Pro (audiobook workflow)
### Test Recording Checklist
â¡ Record 30 seconds of room tone
â¡ Record test passage
â¡ Check RMS levels (-23 to -18 dB)
â¡ Check peak levels (below -3 dB)
â¡ Check noise floor (below -60 dB)
â¡ Listen for room echo or hum
â¡ Verify consistent mic positioning
Step 4: Apply Mayer’s Multimedia Learning Principles
Science-backed techniques for engaging audio.
## Mayer's Principles for Audiobook Delivery
### Voice Principle
"People learn better from human voice than machine voice."
â Warm, conversational delivery beats polished broadcast voice
â Let personality come through
â Avoid monotone "reading" voice
### Personalization Principle
"People learn better when words are conversational rather than formal."
â Write/deliver as if talking to one person
â Use "you" and "we"
â Contractions are encouraged
### Segmenting Principle
"People learn better when content is in learner-paced segments."
â Clear chapter breaks
â Section pauses for complex material
â Chapter summaries for dense non-fiction
### Redundancy Principle (for video/slides)
"People learn better from graphics and narration than graphics + narration + text."
â For audiobooks: don't describe visual elements you can't show
â Adapt visual content or acknowledge it's omitted
### Coherence Principle
"People learn better when extraneous material is excluded."
â Cut verbal filler
â Remove tangents that don't serve the chapter
â Every word earns its place
Step 5: Production Workflow
Systematic process from script to master.
## Audiobook Production Pipeline
### Phase 1: Pre-Production (Week 1-2)
â¡ Manuscript finalized and proofread
â¡ Pronunciation guide complete
â¡ Chapter structure confirmed
â¡ Recording schedule created
â¡ Test recordings approved
### Phase 2: Recording (Weeks 2-6, varies by length)
â¡ Opening credits recorded first
â¡ Chapters recorded in order (usually)
â¡ Room tone captured each session
â¡ Daily backup of raw files
â¡ Notes logged for pickups needed
### Phase 3: Editing (Concurrent or following)
â¡ Remove mistakes and restart takes
â¡ Remove excessive breaths (keep natural ones)
â¡ Eliminate mouth clicks and pops
â¡ Smooth edits (crossfade)
â¡ Consistent pacing throughout
### Phase 4: Mastering
â¡ Noise reduction applied (light touch)
â¡ Compression for consistent levels
â¡ Limiting to prevent peaks
â¡ Final RMS check (-23 to -18 dB)
â¡ Export to ACX specs (192 kbps MP3, 44.1kHz, mono)
### Phase 5: Quality Control
â¡ Listen to every file start to finish
â¡ Check first/last 5 seconds of each file
â¡ Verify file naming convention
â¡ Run ACX Check or similar tool
â¡ Create retail sample (5 min)
Step 6: Credits and Sample
Specific requirements for opening, closing, and retail sample.
## Opening Credits Script
(Must be first audio file)
"[Book Title]"
Written by [Author Name]
Narrated by [Narrator Name]
[Optional: Publisher name]
[Optional: Copyright year]
---
## Closing Credits Script
(Must be last audio file)
"This has been [Book Title], written by [Author Name],
narrated by [Narrator Name]."
[Optional: "Visit [website] for more" or other approved callout]
[Optional: "Audiobook production by [producer]"]
---
## Retail Sample Guidelines
**Length**: Approximately 5 minutes
**Content**: Representative excerpt that hooks listeners
**Selection criteria**:
- Showcases narrator's ability
- Engages interest without major spoilers
- Demonstrates the book's voice/style
- Often from early chapters (context not required)
**Do NOT include**: Credits or chapter announcements in sample
Examples
Example 1: Self-Published Non-Fiction Audiobook
Context: An author is self-narrating their business book.
Input:
Help me plan audiobook production for my 55,000-word business book. I’m self-narrating. I have a Blue Yeti mic and record in my home office.
Output:
Production Plan: Business Book Audiobook
Project Specifications
- Word count: 55,000 words
- Estimated runtime: 5.5-6 hours
- Narrator: Author (self-narrated)
- Equipment: Blue Yeti USB mic (adequate for ACX)
Pre-Production (Days 1-3)
Manuscript Prep
- Review all 12 chapters for audio adaptation
- Convert/adapt 8 figures referenced in text
- Create pronunciation guide for industry terms
- Mark chapter breaks and section pauses
- Estimate 4,500 words/chapter average = ~29 min/chapter
Room Preparation
- Assess home office acoustics
- Add blanket on wall behind recording position
- Use closet with clothes as recording booth alternative
- Test recordings at 3 different times of day (HVAC, traffic noise)
Sample Script: “As shown in Figure 3” â “Here’s how it works: imagine a funnel with three stages…”
Recording Schedule (Weeks 1-3)
Daily recording block: 2 hours max (voice fatigue) Target per session: 1.5-2 chapters (8,000 words) Total sessions needed: ~8-10
Schedule:
- Day 1: Opening credits + Chapter 1
- Day 2: Chapters 2-3
- Day 3: Chapters 4-5
- [continue pattern]
- Day 8: Chapter 12 + Closing credits
- Days 9-10: Pickups and problem sections
Session Ritual:
- Hydrate 30 min before (room temp water)
- Vocal warmup (5 min)
- Record room tone (10 sec)
- Test recording, check levels
- Record chapter(s)
- Log pickup notes
Post-Production (Week 4)
Editing workflow per chapter:
- Import raw audio
- Remove mistakes and alternate takes
- Clean up breaths and mouth clicks
- Apply light noise reduction
- Normalize RMS to -20 dB
- Check peaks (below -3 dB)
- Export as MP3 (192 kbps, 44.1 kHz, mono)
File naming:
- BusinessBookTitle_Opening_Credits.mp3
- BusinessBookTitle_001.mp3
- BusinessBookTitle_002.mp3
- …
- BusinessBookTitle_012.mp3
- BusinessBookTitle_Closing_Credits.mp3
Quality Control (Days 1-2)
- Listen through every file with checklist
- Run ACX Check on all files
- Create 5-minute retail sample from Chapter 2
- Verify consistent sound across all chapters
- Final review of opening/closing credits
Technical Settings
Recording:
- 44.1 kHz, 24-bit WAV (edit in high quality)
- Mono input
- Gain set for peaks at -12 dB
Export (ACX compliant):
- MP3, 192 kbps CBR
- 44.1 kHz
- Mono
- RMS: -20 dB (within -23 to -18)
- Peak: -3 dB max
Example 2: ACX Troubleshooting
Context: Audiobook files are being rejected by ACX.
Input:
My audiobook submission was rejected. ACX says “RMS levels out of spec” and “noise floor too high.” How do I fix this?
Output:
ACX Rejection Fix: RMS and Noise Floor
Diagnosis
RMS out of spec: Your audio is either too quiet (below -23 dB) or too loud (above -18 dB)
Noise floor too high: Background hiss or hum is above -60 dB
Step-by-Step Fix
Fix 1: RMS Levels
In Audacity:
- Select all audio (Ctrl+A)
- Effect â Loudness Normalization
- Set to: -20 dB RMS (middle of ACX range)
- Apply
In Adobe Audition:
- Select all
- Effects â Amplitude â Match Loudness
- Set Target Loudness to -20 LUFS
- Process
Fix 2: Noise Floor
If noise is mild (just room tone):
- Find 2-3 seconds of “silence” (room tone only)
- Select it as noise profile
- Apply noise reduction to full track
- Use minimal settings (6-12 dB reduction) to avoid artifacts
In Audacity:
- Effect â Noise Reduction â Get Noise Profile
- Select all â Noise Reduction â 6-12 dB, 6 Sensitivity
If noise is significant:
- Re-record in quieter environment
- Use noise gate (removes sound below threshold)
- Consider iZotope RX for professional repair
Fix 3: Verify Peaks
After RMS adjustment, peaks may exceed -3 dB:
- Effect â Limiter
- Set ceiling to -3.0 dB
- Apply
Verification Workflow
- Make fixes to one chapter first
- Run ACX Check plugin or online tool
- Verify all specs pass:
- RMS: -23 to -18 dB â
- Peak: -3 dB or lower â
- Noise floor: -60 dB or lower â
- Apply same process to all files
- Re-check each file before resubmitting
Prevention for Future
Recording checklist:
- Noise floor below -60 dB before recording
- Gain set so peaks hit -12 dB when speaking
- Room tone recorded each session
- Test file checked before full session
Checklists & Templates
ACX Pre-Submission Checklist
## Technical Compliance
â¡ All files are 192 kbps MP3, 44.1 kHz, mono
â¡ RMS levels between -23 and -18 dB
â¡ Peak values at or below -3 dB
â¡ Noise floor at or below -60 dB
â¡ No clipping or distortion
â¡ Consistent levels across all chapters
## File Structure
â¡ Opening credits file present
â¡ Closing credits file present
â¡ Files named correctly: [Title]_[Number].mp3
â¡ Chapter numbers are sequential
â¡ No missing chapters
â¡ Retail sample created (â5 minutes)
## Content Quality
â¡ First/last 5 seconds of each file are clean (room tone or fade)
â¡ No extraneous sounds (clicks, pops, breaths)
â¡ No room echo or reverb issues
â¡ Consistent voice quality throughout
â¡ All pickups and corrections completed
â¡ Credits read correctly (title, author, narrator)
## Final Review
â¡ Listened to complete audiobook
â¡ Spot-checked transitions between chapters
â¡ Verified retail sample is representative
â¡ Cover art meets ACX specs (if applicable)
â¡ All metadata entered correctly
Production Schedule Template
## Audiobook Production Schedule
**Title**: ________________________________
**Word count**: ___________ | **Est. runtime**: ___________
**Narrator**: _____________________________
**Target completion**: ____________________
### Pre-Production: Week ___
| Task | Due | Done |
|------|-----|------|
| Manuscript finalized | | â¡ |
| Pronunciation guide | | â¡ |
| Room/equipment tested | | â¡ |
| Opening credits scripted | | â¡ |
### Recording: Weeks ___ - ___
| Session | Chapters | Words | Est. Time | Done |
|---------|----------|-------|-----------|------|
| 1 | Credits + Ch 1 | | | â¡ |
| 2 | Ch 2-3 | | | â¡ |
| 3 | Ch 4-5 | | | â¡ |
| ... | | | | â¡ |
### Post-Production: Week ___
| Task | Due | Done |
|------|-----|------|
| Editing complete | | â¡ |
| Mastering complete | | â¡ |
| ACX specs verified | | â¡ |
| QC listen-through | | â¡ |
| Retail sample created | | â¡ |
### Submission: ___________
Skill Boundaries
What This Skill Does Well
- Structuring audio production workflows
- Providing technical guidance
- Creating quality checklists
- Suggesting creative approaches
What This Skill Cannot Do
- Replace audio engineering expertise
- Make subjective creative decisions
- Access or edit audio files directly
- Guarantee commercial success
References
- ACX. “Audio Submission Requirements” – Official technical specifications
- Richard Mayer. “Multimedia Learning” (2009) – Scientific principles for audio learning
- iZotope. “Professional Voice Over Recording Tips” – Technical recording guidance
- Audacity Team. “Audacity Manual” – Free editing software
Related Skills
- voiceover-direction – Directing talent for quality delivery
- audio-editing – Post-production fundamentals
- voice-design – AI voice alternatives for narration
- transcription-to-content – Repurposing audiobook content
Skill Metadata (Internal Use)
name: audiobook-production
category: audio
subcategory: voiceover
version: 1.0
author: MKTG Skills
source_expert: ACX, Richard Mayer
source_work: ACX Submission Requirements, Multimedia Learning
difficulty: intermediate
estimated_value: $500-2,000 per audiobook (equivalent production management)
tags: [audiobook, acx, audible, narration, production]
created: 2026-01-26
updated: 2026-01-26