agent-browser
4
总安装量
4
周安装量
#52111
全站排名
安装命令
npx skills add https://github.com/fmschulz/omics-skills --skill agent-browser
Agent 安装分布
gemini-cli
4
codex
4
cursor
4
trae
3
antigravity
3
codebuddy
3
Skill 文档
Agent Browser
Automate browser interactions through the agent-browser CLI for repeatable, scriptable web tasks.
Instructions
- Install and initialize the CLI.
- Open the target URL and capture a snapshot.
- Interact with elements using snapshot references.
- Re-snapshot after navigation or state changes.
- Export results (screenshots or JSON) for downstream use.
Quick Reference
| Task | Action |
|---|---|
| Install | npm install -g agent-browser then agent-browser install |
| Open page | agent-browser open <url> |
| Snapshot | agent-browser snapshot -i --json |
| Interact | click @eN, fill @eN "text" |
| Screenshot | agent-browser screenshot output.png |
| Docs | See references/quick-start.md |
Input Requirements
- Target URL(s)
- CLI installed and Chromium downloaded
- Credentials if login is required
Output
- Screenshots (PNG)
- JSON snapshots of page structure
- Extracted text/attributes
Quality Gates
- Snapshot captured after each major navigation step
- Interactions verified in a follow-up snapshot
- Outputs saved to disk with clear filenames
Examples
Example 1: Capture a page snapshot
agent-browser open https://example.org
agent-browser snapshot -i --json > page.json
Troubleshooting
Issue: Chromium not installed
Solution: Run agent-browser install (add --with-deps on Linux).
Issue: Element not found Solution: Re-snapshot and confirm the correct element reference.