picture
9
总安装量
9
周安装量
#33018
全站排名
安装命令
npx skills add https://github.com/spm1001/claude-suite --skill picture
Agent 安装分布
github-copilot
9
codex
9
kimi-cli
9
gemini-cli
9
cursor
9
amp
9
Skill 文档
Image Generation
Generate AI images using Google Imagen via the Gemini API.
When to Use
- Presentation images (hero visuals, section dividers)
- Conceptual illustrations (visual metaphors, abstract concepts)
- Photo-realistic images (product mockups, scenarios)
- Visual explanations that benefit from AI generation
When NOT to Use
- Precise diagrams or charts â use
diagrammingskill (editable SVG, exact data) - Screenshots â use
screenshotskill - Simple icons â often faster to find stock or use emoji
Overlap with diagramming: There’s fuzzy boundary. A “chart for a presentation” could go either way:
- Need precise data, editability â diagramming
- Need striking visual punch â image-generation
- Use judgement; ask if unclear.
Workflow
1. Understand the Need
Clarify with user:
- Purpose â presentation, concept illustration, visual metaphor?
- Style â photorealistic, illustration, abstract?
- Brand â does it need ITV styling? (if so, read itv-styling skill)
2. Draft with Flash
Use the fast model for initial iterations:
~/.claude/skills/picture/imagen.sh "prompt" --model gemini-2.5-flash-image
3. Review and Refine
Open the image, assess, iterate:
# Edit mode: refine previous output
~/.claude/skills/picture/imagen.sh "make it warmer, add more contrast" --input ./images/previous.png
4. Final Render with Pro
For client-facing or final deliverables:
~/.claude/skills/picture/imagen.sh "prompt" --model gemini-3-pro-image-preview
Command Reference
# Basic generation
imagen.sh "prompt" [--output ./images] [--model MODEL]
# Edit existing image
imagen.sh "refinement prompt" --input previous.png
# Models
--model gemini-2.5-flash-image # Fast, cheap (default)
--model gemini-3-pro-image-preview # Higher quality
--model imagen-4.0-generate-preview-06-06 # Imagen 4
Output: Saves to ./images/ with timestamped filename, prints path.
Prompting Framework
Based on Max Woolf’s Nano Banana research.
Structure
[Specific object description with exact requirements in CAPS]
Aspects that MUST be followed EXACTLY:
- [Compositional rule 1]
- [Compositional rule 2]
[Publication/camera details for style elevation]
Do not include [unwanted elements].
Key Techniques
| Technique | Example |
|---|---|
| Structured bullets | Requirements as dashed list, not prose |
| ALL CAPS constraints | “MUST”, “EXACTLY” increases adherence |
| Hex colors | #9F2B68 more precise than “magenta” |
| Composition rules | “rule of thirds”, “negative space”, “depth of field” |
| Style elevators | “Pulitzer Prize-winning cover photo for NYT” |
| Camera specs | “Canon EOS 90D DSLR camera” |
| Publication targets | “Vanity Fair cover profile” |
| Negative constraints | “Do not include text, watermarks, or line overlays” |
Example Prompt
A professional headshot of a confident business executive.
Aspects that MUST be followed EXACTLY:
- Shot from shoulders up, rule of thirds composition
- Neutral background with soft gradient #E8E8E8 to #FFFFFF
- Natural 3PM diffuse lighting from left
- Sharp focus on eyes, slight bokeh on background
Pulitzer Prize-winning portrait, Canon EOS R5, 85mm f/1.4.
Do not include any text, logos, or watermarks.
Composing with Brand Skills
With itv-styling
When creating ITV-branded images:
- Read
itv-stylingfor color palette and principles - Bake brand constraints into prompt:
Corporate presentation image for ITV.
Aspects that MUST be followed EXACTLY:
- Dark background #0F2323 (ITV dark green)
- Accent elements in #E8E557 (ITV yellow) or #4ECDC4 (ITV teal)
- Clean, modern, professional aesthetic
- No busy patterns or off-brand colors
Professional corporate photography style.
With diagramming
For hybrid needs (visual + precise data):
- Generate AI background/illustration with image-generation
- Overlay precise elements with diagramming
- Composite manually if needed
Limitations
| Limitation | Workaround |
|---|---|
| Style transfer fails (“Studio Ghibli style”) | Use structural descriptions instead |
| Text generation imperfect | Add text as overlay after generation |
| Exact positioning difficult | Iterate with refinement prompts |
| Rate limits | Use Flash for drafts, Pro only for finals |
Output Location
Images save to ./images/ in the project directory:
- Created on first use
- Timestamped filenames for uniqueness
- Stays with project for easy reference
Anti-Patterns
| Pattern | Problem | Fix |
|---|---|---|
| Skip brand check | Inconsistent styling | Load itv-styling first when brand applies |
| Vague prompts | Poor results | Use specific, concrete descriptions |
| Wrong tool for data | Inaccurate charts | Use diagram skill for precise data |
See Also
references/prompting.mdâ Extended prompting referencediagrammingskill â For precise diagrams and chartsitv-stylingskill â For brand-constrained outputs