project-docs

📁 donghaozhang/video-agent-skill 📅 Feb 11, 2026
4
总安装量
4
周安装量
#48018
全站排名
安装命令
npx skills add https://github.com/donghaozhang/video-agent-skill --skill project-docs

Agent 安装分布

gemini-cli 4
claude-code 4
openclaw 4
opencode 4
trae 3
replit 3

Skill 文档

AI Content Pipeline — Documentation Index

73 AI models across 12 categories. Click-based CLI (aicp). Central model registry. YAML pipelines with parallel execution.

Key Facts

  • Package: video-ai-studio on PyPI, commands aicp / ai-content-pipeline
  • Version: 1.0.24, Python 3.10+
  • Registry: packages/core/ai_content_pipeline/ai_content_pipeline/registry.py + registry_data.py
  • CLI: Click framework in cli/click_app.py, commands in cli/commands/ (6 modules, 19 commands + vimax subgroup)
  • Providers: FAL AI (primary, 30+ models), Google (Gemini/Veo), ElevenLabs (TTS), OpenRouter (prompts), Replicate
  • API keys: FAL_KEY, GEMINI_API_KEY, ELEVENLABS_API_KEY, OPENROUTER_API_KEY in .env

Documentation Files

Load the specific file when Claude needs deeper information on that topic.

Models & Providers

File Load When
models.md Selecting models, checking pricing, comparing options across 12 categories
provider-comparison.md Choosing between FAL AI, Google, ElevenLabs, OpenRouter, Replicate

CLI & API

File Load When
cli-commands.md CLI usage, command flags, global options (–json, –quiet, –stream, –input)
python-api.md Python API: AIPipelineManager methods, data classes, error handling
aicp-vimax-commands.md ViMax subgroup: novel2movie, idea2video, script2video pipelines

Architecture & Code

File Load When
architecture.md System design, data flow diagrams, component responsibilities
package-structure.md File locations, import paths, module dependencies

Pipelines

File Load When
yaml-pipelines.md YAML config syntax, 10 step types, variable interpolation, dependencies
parallel-execution.md Parallel groups, performance optimization, 2-3x speedup patterns

Content Creation

File Load When
prompting.md Writing effective prompts, templates, model-specific tips
video-tips.md Image-to-video vs text-to-video, motion prompts, model selection
video-analysis.md AI video analysis with Gemini, timeline/describe/transcribe modes

Optimization

File Load When
cost-management.md Pricing tables, budget strategies, cost estimation CLI/API
performance.md Speed benchmarks, batching, caching, network optimization
best-practices.md Project organization, pipeline design patterns, QA workflows

Troubleshooting

File Load When
error-codes.md Error codes AUTH/CFG/MDL/PIP/NET/FILE/RATE/COST/VAL with solutions
troubleshooting.md Diagnostic steps, common issues, installation/API/network problems
faq.md Frequently asked questions across 9 categories

Development

File Load When
testing.md Test strategies, mocks, fixtures, CI/CD, running pytest
security.md API key security, input validation, production deployment
contributing.md Development workflow, coding standards, adding models/providers
migration.md Version upgrades, breaking changes, rollback instructions

Setup & Learning

File Load When
setup.md Installation (PyPI/source), venv, API keys, first pipeline
learning-path.md Structured learning tracks: Quick Start, Comprehensive, Developer

Examples

File Load When
basic-examples.md Simple image/video generation, CLI examples
advanced-pipelines.md Multi-stage production, A/B testing, batch processing
use-cases.md Marketing, education, enterprise, gaming applications
integrations.md Flask, FastAPI, Celery, webhook patterns

Model Categories Quick Reference

Category Count Key Models Cost Range
Text-to-Image 8 flux_dev, flux_schnell, imagen4, nano_banana_pro $0.001-0.08
Image-to-Image 8 photon, kontext, clarity, seededit $0.015-0.05
Text-to-Video 10 veo3, kling_3_pro, sora_2, hailuo_pro $0.08-6.00
Image-to-Video 15 veo_3_1_fast, kling_3_pro_i2v, sora_2_i2v $0.08-3.60
Video-to-Video 4 kling_o3_pro_edit, kling_o3_standard_edit $0.25-0.34/s
Avatar 10 omnihuman_v1_5, fabric_1_0, multitalk $0.06-0.25/s
Image Understanding 7 gemini_describe, gemini_detailed, gemini_qa $0.001-0.002
Prompt Generation 5 openrouter_video_prompt + style variants $0.002
Text-to-Speech 3 elevenlabs, elevenlabs_turbo, elevenlabs_v3 $0.03-0.08
Speech-to-Text 1 scribe_v2 $0.008/min
Add Audio 1 thinksound $0.001/s
Upscale Video 1 topaz ~$1.50/video

CLI Commands Quick Reference

# Core generation
aicp generate-image --text "prompt" --model flux_dev
aicp create-video --text "prompt" --video-model kling_3_pro
aicp run-chain --config pipeline.yaml [--parallel] [--stream] [--dry-run]

# Media operations
aicp generate-avatar --image-url URL --audio-url URL --model omnihuman_v1_5
aicp analyze-video -i video.mp4 [-t timeline|describe|transcribe]
aicp transcribe --input audio.mp3 [--srt] [--raw-json]
aicp transfer-motion --image-url URL --video-url URL
aicp upscale-image --image photo.png --upscale 2
aicp generate-grid --text "prompt" --layout 2x2

# Discovery
aicp list-models [--category X] [--provider X] [--json]
aicp list-avatar-models | list-video-models | list-motion-models | list-speech-models

# Project
aicp setup | init-project | organize-project | structure-info | create-examples

# ViMax (novel-to-video)
aicp vimax idea2video --idea "concept"
aicp vimax novel2movie --novel novel.txt [--storyboard-only]
aicp vimax script2video --script story.txt

Architecture Summary

CLI (Click) → Pipeline Manager → Providers → External APIs
                    ↓                            ↑
              Config (YAML)              FAL / Google / ElevenLabs / OpenRouter
                    ↓
              Executor → Parallel Engine → Results → Output Files
  • Central Registry: registry.py defines ModelDefinition + ModelRegistry, registry_data.py registers all 73 models
  • Auto-Discovery: Generator classes use MODEL_KEY class attributes
  • CLI: Root Click group in cli/click_app.py, commands auto-registered at import time
  • Pipeline: manager.py orchestrates, executor.py runs steps, parallel.py handles concurrency

Testing Quick Reference

python -m pytest tests/ -v                    # Full suite (~844 tests)
python -m pytest tests/test_registry.py -v    # Registry tests
python -m pytest tests/test_click_app.py -v   # CLI tests
python scripts/validate_registry.py           # Registry validation