google-image-creator

📁 horace4444/extend-my-claude-code 📅 10 days ago
1
总安装量
1
周安装量
#50224
全站排名
安装命令
npx skills add https://github.com/horace4444/extend-my-claude-code --skill google-image-creator

Agent 安装分布

openclaw 1
opencode 1
claude-code 1

Skill 文档

Google AI Image Generation

Generate images using Google’s Imagen and Gemini models with automatic cost tracking.

Quick Start

1. Present Model Options

When a user wants to generate images, first show them the top 3 options:

npx tsx .claude/skills/google-image-creator/scripts/list-models.ts

This displays:

  • Gemini 2.5 Flash Image ($0.039/image) – Best value, fastest
  • Imagen 4 Fast ($0.020/image) – Photorealistic quality
  • Imagen 4 Ultra ($0.060/image) – Highest quality

Let the user choose, or recommend Gemini 2.5 Flash for most use cases.

2. Generate Image

npx tsx .claude/skills/google-image-creator/scripts/generate-image.ts \
  "prompt here" \
  "model-id" \
  "./output.png"

Example:

npx tsx .claude/skills/google-image-creator/scripts/generate-image.ts \
  "sunset over mountains, photorealistic" \
  "gemini-2.5-flash-image" \
  "./sunset.png"

3. Report Cost

The script automatically reports:

  • Images generated
  • Model used
  • Estimated cost

Example output:

✅ Image generated successfully!
   Model: Gemini 2.5 Flash Image (gemini-2.5-flash-image)
   Prompt: "sunset over mountains"
   Saved to: ./sunset.png
   Images: 1
   Cost: $0.0390

Prerequisites

API Key Required:

User must have GOOGLE_API_KEY or GEMINI_API_KEY set:

export GOOGLE_API_KEY="user-api-key-here"

Get key at: https://aistudio.google.com/app/apikey

Note: All Google image generation is paid tier only (no free quota).

Workflow

Standard Workflow

  1. User Request: “Generate an image of X”

  2. Ask Model Preference: “Which model would you like to use?” Then run:

    npx tsx .claude/skills/google-image-creator/scripts/list-models.ts
    
  3. User Chooses: User selects model or you recommend based on needs

  4. Generate: Run generation script with chosen model

  5. Report: Display cost and save location

Quick Workflow (Skip Selection)

If user says “use the cheapest” or doesn’t specify:

  • Default to gemini-2.5-flash-image
  • Generate immediately
  • Report cost afterward

Model Selection Guide

Use Gemini 2.5 Flash when:

  • User wants lowest cost
  • High volume generation
  • Speed is priority
  • General-purpose images

Use Imagen 4 Fast when:

  • User needs photorealism
  • Balance of quality and cost
  • Simple text-to-image only

Use Imagen 4 Ultra when:

  • User explicitly requests highest quality
  • Client-facing deliverables
  • Budget allows premium pricing

Cost Tracking

Always report costs after generation. For multiple images, sum costs:

Image 1: $0.039
Image 2: $0.039
Image 3: $0.039
-----------------
Total: $0.117

Error Handling

Common Errors:

Error Solution
“API key not set” Tell user to set GOOGLE_API_KEY
“Unknown model” Run list-models.ts to show valid models
“API Error 401” API key invalid – user needs to verify key
“API Error 429” Rate limit – wait and retry
“Quota exceeded” Gemini image generation requires billing-enabled API key (free tier = 0 quota)
“Imagen API is only accessible to billed users” Enable billing on your Google Cloud project

Advanced: Multiple Images

For generating multiple images, run script multiple times and sum costs:

for i in {1..5}; do
  npx tsx .claude/skills/google-image-creator/scripts/generate-image.ts \
    "landscape $i" \
    "gemini-2.5-flash-image" \
    "./landscape_$i.png"
done

echo "Total: 5 images × $0.039 = $0.195"

Updating Model Information

To update pricing or add new models:

  1. Check official pricing: https://ai.google.dev/pricing
  2. Update references/models.md
  3. Update scripts/generate-image.ts MODEL_PRICING
  4. Update scripts/list-models.ts TOP_3_MODELS
  5. Repackage skill

Files to update:

  • references/models.md – Full documentation
  • references/api-guide.md – API patterns
  • scripts/generate-image.ts – MODEL_PRICING object
  • scripts/list-models.ts – TOP_3_MODELS array

References

Detailed Documentation:

Official Links:

Examples

Example 1: Basic Generation

User: “Generate a sunset image”

You: “I’ll generate a sunset image. Which model would you like?”

npx tsx .claude/skills/google-image-creator/scripts/list-models.ts

User: “Use the recommended one”

You:

npx tsx .claude/skills/google-image-creator/scripts/generate-image.ts \
  "beautiful sunset over ocean, warm colors, photorealistic" \
  "gemini-2.5-flash-image" \
  "./sunset.png"

“✅ Generated sunset.png using Gemini 2.5 Flash Image. Cost: $0.0390”


Example 2: High-Quality Request

User: “Generate a professional product photo, highest quality”

You: “For highest quality, I recommend Imagen 4 Ultra ($0.06/image). Proceed?”

User: “Yes”

You:

npx tsx .claude/skills/google-image-creator/scripts/generate-image.ts \
  "professional product photo of ceramic mug, studio lighting, high detail" \
  "imagen-4.0-ultra-generate-001" \
  "./product.png"

“✅ Generated product.png using Imagen 4 Ultra. Cost: $0.0600”


Example 3: Budget-Conscious

User: “Generate 10 landscape images, keep it cheap”

You: “I’ll use Gemini 2.5 Flash ($0.039/image). Total cost: $0.39 for 10 images.”

# Generate 10 images
# Sum costs

“✅ Generated 10 landscapes. Total cost: $0.3900”

Tips

Prompt Engineering:

  • Be specific: “sunset over mountains, warm colors, golden hour”
  • Mention style: “photorealistic”, “watercolor”, “3D render”
  • Include lighting: “soft natural light”, “studio lighting”
  • Specify details: “high resolution”, “shallow depth of field”

Cost Optimization:

  • Use Gemini 2.5 Flash for testing prompts
  • Only use Imagen 4 Ultra for final deliverables
  • Batch mode available for Gemini models (50% discount)
  • Iterate on cheaper models first

File Management:

  • Always specify output path
  • Use descriptive filenames
  • Organize by project/date
  • Keep generated images for reference