steel-browser

📁 steel-dev/cli 📅 2 days ago

总安装量

周安装量

#33284

全站排名

安装命令

npx skills add https://github.com/steel-dev/cli --skill steel-browser

Agent 安装分布

codex 9

gemini-cli 8

github-copilot 8

kimi-cli 8

cursor 8

amp 8

Skill 文档

Steel Browser Skill

Use this skill to produce reliable, executable steel browser workflows for agent-driven automation.

Why this skill exists

steel browser keeps broad command compatibility with agent-browser while adding Steel-native session lifecycle and endpoint handling. Agents get better results when they use named sessions, deterministic mode selection, and task-specific command patterns instead of one-off ad-hoc commands.

Use this workflow

Identify the operating mode first.
Start or attach a named session before interactive work.
Run task commands in small verifiable steps.
Validate results (wait, snapshot, get ...) before moving on.
Stop sessions when the task is done unless the user asks to keep them alive.

Mode selection rules

Use cloud mode by default.
Use self-hosted mode if the user specifies --local, --api-url, or self-hosted infrastructure.
Keep mode consistent for all commands in a sequence.

Read references/steel-browser-lifecycle.md for full lifecycle and endpoint precedence details.

Session discipline

Prefer --session <name> across all commands in one workflow.
Parse and preserve session id from steel browser start when needed for downstream tooling.
Treat connect_url as display-safe metadata; do not treat it as a raw secret-bearing URL.

Task execution pattern

Use this skeleton and adapt commands to the task:

SESSION="task-$(date +%s)"
steel browser start --session "$SESSION"
steel browser open <url> --session "$SESSION"
steel browser snapshot -i --session "$SESSION"
# perform interactions/get/snapshot commands
steel browser stop --session "$SESSION"

For command families and examples, read references/steel-browser-commands.md.

Migration behavior

When users provide agent-browser commands or scripts:

Convert command prefix from agent-browser to steel browser.
Preserve original behavior intent.
Add Steel lifecycle commands (start, stop, sessions, live) when explicit session control is needed.

Read references/migration-agent-browser.md.

Troubleshooting behavior

On auth, local runtime, stale sessions, CAPTCHA blocks, or attach errors:

Diagnose with sessions, live, and mode/endpoint checks.
For CAPTCHA: use mode-aware guidance from the troubleshooting reference. Auto mode: wait and monitor screenshots. Manual mode: run steel browser captcha solve. Off mode: restart with CAPTCHA solving enabled and return to the blocked page.
Remind users that CAPTCHA solving and strong proxy evasion require a paid Steel plan.
Provide a minimal corrective command sequence.
Retry the original action sequence.

Read references/troubleshooting.md.

Response format

When giving users commands, prefer this structure:

Mode: cloud/self-hosted and why.
Session: chosen session name and lifecycle steps.
Commands: exact executable sequence.
Checks: what output confirms success.

If the user asks for terse output, keep the same order but shorten prose.

Guardrails

Do not print or request raw API keys in command output.
Do not mix cloud and local mode in one flow unless explicitly transitioning.
Do not assume an existing active session without checking.
For inherited command uncertainty, use steel browser <command> --help.
There is no top-level steel browser extract command; use steel browser get ..., steel browser snapshot, and steel browser find ... instead.

Reference routing table

Lifecycle, endpoint precedence, attach rules: references/steel-browser-lifecycle.md
Complete command families and examples: references/steel-browser-commands.md
Migration from upstream command usage: references/migration-agent-browser.md
Error handling and recovery playbooks: references/troubleshooting.md

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台