baoyu-image-gen

📁 stonehah/qunz-skills 📅 2 days ago

总安装量

周安装量

#48460

全站排名

安装命令

npx skills add https://github.com/stonehah/qunz-skills --skill baoyu-image-gen

Agent 安装分布

gemini-cli 4

github-copilot 4

codex 4

kimi-cli 4

amp 4

cursor 4

Skill 文档

Image Generation (AI SDK)

Official API-based image generation. Supports OpenAI, Google, DashScope (é¿ééä¹ä¸è±¡), Replicate and APIMart providers.

Script Directory

Agent Execution:

SKILL_DIR = this SKILL.md file’s directory
Script path = ${SKILL_DIR}/scripts/main.ts

Preferences (EXTEND.md)

Use Bash to check EXTEND.md existence (priority order):

# Check project-level first
test -f .baoyu-skills/baoyu-image-gen/EXTEND.md && echo "project"

# Then user-level (cross-platform: $HOME works on macOS/Linux/WSL)
test -f "$HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md" && echo "user"

ââââââââââââââââââââââââââââââââââââââââââââââââââââ¬ââââââââââââââââââââ â Path â Location â ââââââââââââââââââââââââââââââââââââââââââââââââââââ¼ââââââââââââââââââââ¤ â .baoyu-skills/baoyu-image-gen/EXTEND.md â Project directory â ââââââââââââââââââââââââââââââââââââââââââââââââââââ¼ââââââââââââââââââââ¤ â $HOME/.baoyu-skills/baoyu-image-gen/EXTEND.md â User home â ââââââââââââââââââââââââââââââââââââââââââââââââââââ´ââââââââââââââââââââ

âââââââââââââ¬ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ â Result â Action â âââââââââââââ¼ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤ â Found â Read, parse, apply settings â âââââââââââââ¼ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤ â Not found â Use defaults â âââââââââââââ´ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ

EXTEND.md Supports: Default provider | Default quality | Default aspect ratio | Default image size | Default models

Schema: references/config/preferences-schema.md

Usage

# Basic
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png

# With aspect ratio
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9

# High quality
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k

# From prompt files
npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png

# With reference images (Google multimodal or OpenAI edits)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --ref source.png

# With reference images (explicit provider/model)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make blue" --image out.png --provider google --model gemini-3-pro-image-preview --ref source.png

# Specific provider
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider openai

# DashScope (é¿ééä¹ä¸è±¡)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "ä¸åªå¯ç±çç«" --image out.png --provider dashscope

# Replicate (google/nano-banana-pro)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate

# Replicate with specific model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana

# APIMart (Gemini-3-Pro-Image-preview, async task polling)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat in watercolor style" --image out.png --provider apimart

# APIMart with reference image
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make this into anime style" --image out.png --provider apimart --ref source.png

# APIMart with doubao-seedream-5-0-lite model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cinematic mountain sunrise" --image out.png --provider apimart --model doubao-seedream-5-0-lite

Options

Option	Description
`--prompt <text>`, `-p`	Prompt text
`--promptfiles <files...>`	Read prompt from files (concatenated)
`--image <path>`	Output image path (required)
`--provider google\|openai\|dashscope\|replicate\|apimart`	Force provider (default: auto-detect, usually google if available)
`--model <id>`, `-m`	Model ID (`--ref` with OpenAI requires GPT Image model, e.g. `gpt-image-1.5`; APIMart example: `doubao-seedream-5-0-lite`)
`--ar <ratio>`	Aspect ratio (e.g., `16:9`, `1:1`, `4:3`)
`--size <WxH>`	Size (e.g., `1024x1024`)
`--quality normal\|2k`	Quality preset (default: 2k)
`--imageSize 1K\|2K\|4K`	Image size hint for Google/APIMart (APIMart maps to `resolution`)
`--ref <files...>`	Reference images. Supported by Google multimodal, OpenAI edits (GPT Image models), Replicate and APIMart
`--n <count>`	Number of images (APIMart currently supports only `1`)
`--json`	JSON output

Environment Variables

Variable	Description
`OPENAI_API_KEY`	OpenAI API key
`GOOGLE_API_KEY`	Google API key
`DASHSCOPE_API_KEY`	DashScope API key (é¿éäº)
`REPLICATE_API_TOKEN`	Replicate API token
`APIMART_API_KEY`	APIMart API key
`OPENAI_IMAGE_MODEL`	OpenAI model override
`GOOGLE_IMAGE_MODEL`	Google model override
`DASHSCOPE_IMAGE_MODEL`	DashScope model override (default: z-image-turbo)
`REPLICATE_IMAGE_MODEL`	Replicate model override (default: google/nano-banana-pro)
`APIMART_IMAGE_MODEL`	APIMart model override (default: gemini-3-pro-image-preview, e.g. doubao-seedream-5-0-lite)
`OPENAI_BASE_URL`	Custom OpenAI endpoint
`GOOGLE_BASE_URL`	Custom Google endpoint
`DASHSCOPE_BASE_URL`	Custom DashScope endpoint
`REPLICATE_BASE_URL`	Custom Replicate endpoint
`APIMART_BASE_URL`	Custom APIMart endpoint (default: https://api.apimart.ai)
`APIMART_TASK_LANGUAGE`	Task status language for polling (default: en)

Load Priority: CLI args > EXTEND.md > env vars > <cwd>/.baoyu-skills/.env > ~/.baoyu-skills/.env

Replicate Model Configuration

When using --provider replicate, the model can be configured in the following ways (highest priority first):

CLI flag: --model <owner/name>
EXTEND.md: default_model.replicate
Env var: REPLICATE_IMAGE_MODEL
Built-in default: google/nano-banana-pro

Supported model formats:

owner/name (recommended for official models), e.g. google/nano-banana-pro
owner/name:version (community models by version), e.g. stability-ai/sdxl:<version>

Examples:

# Use Replicate default model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate

# Override model explicitly
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana

Provider Selection

--ref provided + no --provider â auto-select Google first, then OpenAI, then Replicate, then APIMart
--provider specified â use it (if --ref, must be google, openai, replicate, or apimart)
Only one API key available â use that provider
Multiple available â default to Google

Quality Presets

Preset	Google imageSize	OpenAI Size	Use Case
`normal`	1K	1024px	Quick previews
`2k` (default)	2K	2048px	Covers, illustrations, infographics

Google/APIMart imageSize: Can be overridden with --imageSize 1K|2K|4K

Aspect Ratios

Supported: 1:1, 16:9, 9:16, 4:3, 3:4, 2.35:1

Google multimodal: uses imageConfig.aspectRatio
Google Imagen: uses aspectRatio parameter
OpenAI: maps to closest supported size

Generation Mode

Default: Sequential generation (one image at a time). This ensures stable output and easier debugging.

Parallel Generation: Only use when user explicitly requests parallel/concurrent generation.

Mode	When to Use
Sequential (default)	Normal usage, single images, small batches
Parallel	User explicitly requests, large batches (10+)

Parallel Settings (when requested):

Setting	Value
Recommended concurrency	4 subagents
Max concurrency	8 subagents
Use case	Large batch generation when user requests parallel

Agent Implementation (parallel mode only):

# Launch multiple generations in parallel using Task tool
# Each Task runs as background subagent with run_in_background=true
# Collect results via TaskOutput when all complete

Error Handling

Missing API key â error with setup instructions
Generation failure â auto-retry once
Invalid aspect ratio â warning, proceed with default
Reference images with unsupported provider/model â error with fix hint (switch to Google multimodal, OpenAI GPT Image edits, Replicate, or APIMart)

Extension Support

Custom configurations via EXTEND.md. See Preferences section for paths and supported options.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台