wan2122-i2v-comfyui
npx skills add https://smithery.ai
Agent 安装分布
Skill 文档
Wan2.1/2.2 I2V ComfyUI Implementation
Alibabaããªã¼ãã³ã½ã¼ã¹åããWan2.1/2.2 I2Vï¼Image-to-Videoï¼ã¢ãã«ãComfyUIã§ä½¿ç¨ããããã®å æ¬çã¬ã¤ããç»åããåç»ãçæããæ©è½ã®å®è£ ããµãã¼ãããã
Quick Start Checklist
Wan I2Vãå®è£ ããéã®å¿ é æé ï¼
-
ComfyUI ã»ããã¢ãã確èª
- ComfyUIææ°çãã¤ã³ã¹ãã¼ã«
git pullã§æ´æ°ã確èª
-
ã«ã¹ã¿ã ãã¼ãã®ã¤ã³ã¹ãã¼ã«ï¼ä½VRAMåãï¼
cd ComfyUI/custom_nodes git clone https://github.com/city96/ComfyUI-GGUF -
ã¢ãã«ãã¡ã¤ã«ã®é ç½®
- Diffusion Model â
models/diffusion_models/ - Text Encoder â
models/text_encoders/ - VAE â
models/vae/ - CLIP Vision â
models/clip_vision/
- Diffusion Model â
-
ã¯ã¼ã¯ããã¼ã®èªã¿è¾¼ã¿
- Menu â Workflow â Browse Templates â Video â Wan2.2
Model Overview
Wan2.1 vs Wan2.2 æ¯è¼
| ç¹æ§ | Wan2.1 | Wan2.2 |
|---|---|---|
| ãªãªã¼ã¹ | 2025å¹´2æ | 2025å¹´7æ |
| ã¢ãã«ãµã¤ãº | 14B, 1.3B | 14B, 5B |
| I2Vå¯¾å¿ | â | â |
| TI2Vå¯¾å¿ | – | âï¼5Bï¼ |
| æ¨å¥¨VRAM | 40GB+ (14B fp16) | 8GB+ (5Bãªããã¼ã) |
| ã©ã¤ã»ã³ã¹ | Apache 2.0 | Apache 2.0 |
ã¢ãã«é¸æã¬ã¤ã
é«å質éè¦ï¼ãã¤ã¨ã³ãGPUï¼:
- Wan2.2 14B fp16 – æé«å質
- VRAM: 40GB以䏿¨å¥¨
ãã©ã³ã¹éè¦ï¼ããã«ã¬ã³ã¸GPUï¼:
- Wan2.2 5B – å質ã¨VRAMã®ãã©ã³ã¹
- VRAM: 12-16GB
ä½VRAMç°å¢ï¼ã³ã³ã·ã¥ã¼ãã¼GPUï¼:
- Wan2.2 GGUF Q4_K_S – éååç
- VRAM: 8-10GB
Directory Structure
ã¢ãã«ãã¡ã¤ã«ã®é ç½®æ§é ï¼
ComfyUI/
âââ models/
â âââ diffusion_models/
â â âââ wan2.1_i2v_720p_14B_fp8_e4m3fn.safetensors
â â âââ wan2.2_i2v_14B_fp16.safetensors
â â âââ wan2.2_ti2v_5B_fp16.safetensors
â â âââ Wan2.2-I2V-A14B-Q4_K_S.gguf # ä½VRAMç¨
â âââ text_encoders/
â â âââ umt5_xxl_fp8_e4m3fn_scaled.safetensors
â âââ vae/
â â âââ wan2.1_vae.safetensors
â â âââ wan2.2_vae.safetensors
â âââ clip_vision/
â âââ clip_vision_h.safetensors
âââ custom_nodes/
âââ ComfyUI-GGUF/ # ä½VRAMç¨ã«ã¹ã¿ã ãã¼ã
Core Workflow Components
åºæ¬çãªI2Vã¯ã¼ã¯ããã¼æ§æ
[Load Image] â [CLIP Vision Encode] ââ
â
[Load Text Encoder] â [Text Encode] âââ¤
ââ [WanVideo Sampler] â [VAE Decode] â [Save Video]
[Load Diffusion Model] ââââââââââââââââ¤
â
[Load VAE] ââââââââââââââââââââââââââââ
å¿ é ãã¼ãä¸è¦§
| ãã¼ã | ç¨é | è¨å® |
|---|---|---|
| Load Diffusion Model | Wanã¢ãã«èªã¿è¾¼ã¿ | wan2.2_i2v_*.safetensors |
| Load CLIP | ããã¹ãã¨ã³ã³ã¼ãèªã¿è¾¼ã¿ | umt5_xxl_*.safetensors |
| Load VAE | VAEèªã¿è¾¼ã¿ | wan2.1_vae.safetensors |
| Load CLIP Vision | ç»åã¨ã³ã³ã¼ãèªã¿è¾¼ã¿ | clip_vision_h.safetensors |
| WanVideo Sampler | åç»çæãµã³ãã©ã¼ | CFG, Stepsçãè¨å® |
Key Parameters
è§£å度è¨å®
| è¨å® | è§£å度 | ç¨é |
|---|---|---|
| æ¨æºï¼ç¸¦é·ï¼ | 576Ã1024 | ãã¼ãã¬ã¼ãåç» |
| æ¨æºï¼æ¨ªé·ï¼ | 1024Ã576 | ã©ã³ãã¹ã±ã¼ãåç» |
| ä½VRAM | 840Ã480 | ã¡ã¢ãªç¯ç´ |
| é«å質 | 720p (1280Ã720) | 14Bã¢ãã«åã |
çæãã©ã¡ã¼ã¿
| ãã©ã¡ã¼ã¿ | æ¨å¥¨å¤ | 説æ |
|---|---|---|
| ãã¬ã¼ã æ° | 81 | ç´3-4ç§ @24fps |
| CFG | 4-7 | ä½ã=èªç¶ãªåããé«ã=ããã³ããå¿ å® |
| Steps | 20-30 | é«ã=ç´°é¨æ¹åãé度ã¯ä¸å®å®ã« |
| Seed | ä»»æ | åç¾æ§ã®ããåºå®æ¨å¥¨ |
CFG (Classifier-Free Guidance) ã¬ã¤ã
CFG 3.5-4.5: æå¤§éã®åãã¨å¤åãã¯ãªã¨ã¤ãã£ããªåºå
CFG 5.0-6.0: ãã©ã³ã¹ã®åããåãã¨ããã³ããå¿ å®åº¦
CFG 6.5-7.0: ããã³ããã«å¼·ãå¾ããåãã¯æ§ãã
GGUF Quantizationï¼ä½VRAMåãï¼
VRAM 8-12GBç°å¢ã§Wanã¢ãã«ãå®è¡ããããã®éååãªãã·ã§ã³ï¼
| éååã¬ãã« | ã¢ãã«ãµã¤ãº | VRAMç®å® | å質 |
|---|---|---|---|
| Q2_K | ~1.85GB | 6GB | ä½ |
| Q3_K_S | ~2.29GB | 8GB | ä¸ä½ |
| Q4_K_S | ~3.12GB | 10GB | ä¸ï¼æ¨å¥¨ï¼ |
| Q5_K_S | ~3.56GB | 12GB | é« |
GGUFã»ããã¢ãã
- ComfyUI-GGUFãã¤ã³ã¹ãã¼ã«
- éååã¢ãã«ããã¦ã³ãã¼ãï¼HuggingFaceããï¼
models/diffusion_models/ã«é ç½®Load GGUF Modelãã¼ãã使ç¨
Performance Benchmarks
çææéç®å®ï¼81ãã¬ã¼ã ã576Ã1024ï¼
| GPU | Wan2.2 5B | Wan2.2 14B GGUF Q4 |
|---|---|---|
| RTX 4090 | ~22-30ç§ | ~45-60ç§ |
| RTX 4070 | ~55-70ç§ | ~90-120ç§ |
| RTX 3080 | ~90-120ç§ | ~150-180ç§ |
Common Issues & Solutions
| åé¡ | åå | 解決ç |
|---|---|---|
| OOMï¼ã¡ã¢ãªä¸è¶³ï¼ | VRAMä¸è¶³ | è§£å度âãGGUF使ç¨ããã¬ã¼ã æ°â |
| æéç䏿´å | CFG/Stepsé大 | CFG 4-5ãSteps 20-25ã«èª¿æ´ |
| ã¢ãã«ãã¼ã失æ | ãã¹/ååä¸ä¸è´ | ãã£ã¬ã¯ããªæ§é ãç¢ºèª |
| çæãé ã | é«è§£å度/fp16 | GGUFéååãè§£ååº¦åæ¸ |
| åããå°ãªã | CFGé«ãã | CFG 3.5-4.5ã«ä¸ãã |
| å質ãä½ã | éååã¬ãã« | Q4_K_S以ä¸ãä½¿ç¨ |
詳細ãªãã©ãã«ã·ã¥ã¼ãã£ã³ã°ã¯ references/troubleshooting.md ãåç
§ã
Quick Reference
| Component | Purpose | Key API/File |
|---|---|---|
| Diffusion Model | åç»çæã®æ ¸ | wan2.2_i2v_*.safetensors |
| Text Encoder | ããã³ããå¦ç | umt5_xxl_*.safetensors |
| CLIP Vision | ç»åçè§£ | clip_vision_h.safetensors |
| VAE | ã¨ã³ã³ã¼ã/ãã³ã¼ã | wan2.1_vae.safetensors |
| ComfyUI-GGUF | ä½VRAMå¯¾å¿ | ã«ã¹ã¿ã ãã¼ã |
Additional Resources
Reference Files
è©³ç´°ãªæ å ±ã¯ä»¥ä¸ãåç §ï¼
references/model-specifications.md– ã¢ãã«ä»æ§ã®è©³ç´°ï¼Wan2.1/2.2ããã©ã¡ã¼ã¿æ°ãVRAMè¦ä»¶ï¼references/comfyui-setup.md– ComfyUIã»ããã¢ããå®å ¨ã¬ã¤ãreferences/workflow-components.md– ã¯ã¼ã¯ããã¼ã³ã³ãã¼ãã³ãã®è©³ç´°references/parameters-guide.md– ãã©ã¡ã¼ã¿è¨å®ã®è©³ç´°ã¬ã¤ãreferences/gguf-quantization.md– GGUFéååã®è©³ç´°ã¨è¨å®references/troubleshooting.md– ã¨ã©ã¼ä¸è¦§ã¨ãããã°ææ³
Example Files
å®è£
ãµã³ãã«ã¯ examples/ ãã£ã¬ã¯ããªãåç
§ï¼
examples/wan21-basic-i2v.json– Wan2.1åºæ¬I2Vã¯ã¼ã¯ããã¼examples/wan22-i2v-workflow.json– Wan2.2 I2Vã¯ã¼ã¯ããã¼examples/wan22-ti2v-workflow.json– Wan2.2 TI2Vï¼Text+Image to Videoï¼examples/low-vram-gguf.json– ä½VRAMåãGGUFã¯ã¼ã¯ããã¼examples/high-quality-14b.json– é«å質14Bè¨å®ã¯ã¼ã¯ããã¼