agent-browser-skill
1
总安装量
1
周安装量
#44687
全站排名
安装命令
npx skills add https://github.com/mgdaaslab/wharttest --skill agent-browser-skill
Agent 安装分布
openclaw
1
Skill 文档
Agent Browser æµè§å¨èªå¨å
åºäº Vercel agent-browser CLI çæµè§å¨èªå¨åå·¥å ·ï¼ä¸ä¸º AI Agent 设计ã
â ï¸ æ ¸å¿å·¥ä½æµï¼Snapshot + Ref
ç¦æ¢çæµéæ©å¨ï¼å¿ é¡»å è·å Snapshotï¼åéè¿ Ref æä½å ç´ ï¼
# 1. æå¼é¡µé¢
agent-browser open http://example.com
# 2. è·å交äºå
ç´ å¿«ç
§
agent-browser snapshot -i
# è¾åºç¤ºä¾ï¼
# - heading "Example Domain" [ref=e1] [level=1]
# - button "Submit" [ref=e2]
# - textbox "Email" [ref=e3]
# - link "Learn more" [ref=e4]
# 3. ä½¿ç¨ ref è¿è¡æä½
agent-browser click @e2
agent-browser fill @e3 "test@example.com"
agent-browser get text @e1
# 4. 页é¢åååéæ°è·åå¿«ç
§
agent-browser snapshot -i
ä½¿ç¨æ¹æ³
éè¿ shell ç´æ¥è°ç¨ agent-browser å½ä»¤ãææå½ä»¤é½æ¯ç¬ç«çï¼ä¼èªå¨è¿æ¥å°åå°å®æ¤è¿ç¨ç®¡ççæµè§å¨å®ä¾ã
å®è£ è¦æ±
# å
¨å±å®è£
npm install -g agent-browser
# ä¸è½½ Chromium
agent-browser install
æ ¸å¿å½ä»¤
导èª
agent-browser open <url> # æå¼é¡µé¢
agent-browser back # åé
agent-browser forward # åè¿
agent-browser reload # å·æ°
agent-browser close # å
³éæµè§å¨
å¿«ç §ï¼é¡µé¢åæï¼
agent-browser snapshot # 宿´å¯è®¿é®æ§æ
agent-browser snapshot -i # ä»
交äºå
ç´ ï¼æ¨èï¼
agent-browser snapshot -c # ç´§åè¾åº
agent-browser snapshot -d 3 # éå¶æ·±åº¦ä¸º3å±
agent-browser snapshot -s "#main" # èå´éå®å° CSS éæ©å¨
agent-browser snapshot --json # JSON è¾åºï¼éåç¨åºå¤çï¼
äº¤äºæä½ï¼ä½¿ç¨ @refï¼
agent-browser click @e1 # ç¹å»
agent-browser dblclick @e1 # åå»
agent-browser focus @e1 # èç¦å
ç´
agent-browser fill @e2 "text" # æ¸
空并è¾å
¥
agent-browser type @e2 "text" # 追å è¾å
¥ï¼ä¸æ¸
空ï¼
agent-browser press Enter # æé®
agent-browser press Control+a # ç»åé®
agent-browser hover @e1 # æ¬å
agent-browser check @e1 # å¾éå¤éæ¡
agent-browser uncheck @e1 # åæ¶å¾é
agent-browser select @e1 "value" # 鿩䏿é项
agent-browser scroll down 500 # å䏿»å¨ 500px
agent-browser scrollintoview @e1 # æ»å¨å°å
ç´ å¯è§
agent-browser drag @e1 @e2 # ææ½
agent-browser upload @e1 file.pdf # ä¸ä¼ æä»¶
è·åä¿¡æ¯
agent-browser get text @e1 # è·åå
ç´ ææ¬
agent-browser get html @e1 # è·å innerHTML
agent-browser get value @e1 # è·åè¾å
¥æ¡å¼
agent-browser get attr @e1 href # è·å屿§
agent-browser get title # è·å页颿 é¢
agent-browser get url # è·åå½å URL
agent-browser get count ".item" # ç»è®¡å¹é
å
ç´ æ°é
agent-browser get box @e1 # è·åå
ç´ è¾¹çæ¡
ç¶ææ£æ¥
agent-browser is visible @e1 # æ£æ¥æ¯å¦å¯è§
agent-browser is enabled @e1 # æ£æ¥æ¯å¦å¯ç¨
agent-browser is checked @e1 # æ£æ¥æ¯å¦å¾é
æªå¾ & PDF
agent-browser screenshot # æªå¾å°æ åè¾åºï¼base64ï¼
agent-browser screenshot ./page.png # ä¿åå°æä»¶
agent-browser screenshot --full # å
¨é¡µé¢æªå¾
agent-browser pdf output.pdf # ä¿å为 PDF
çå¾
agent-browser wait @e1 # çå¾
å
ç´ å¯è§
agent-browser wait 2000 # çå¾
2000 毫ç§
agent-browser wait --text "æå" # çå¾
ææ¬åºç°
agent-browser wait --url "**/dashboard" # çå¾
URL å¹é
agent-browser wait --load networkidle # çå¾
ç½ç»ç©ºé²
CSS éæ©å¨ï¼ä¹æ¯æï¼
agent-browser click "#submit"
agent-browser fill "#email" "test@example.com"
agent-browser find role button click --name "Submit"
ä¼è¯ç®¡ç
å¤ä¸ª AI Agent å¯ä½¿ç¨ä¸åçæµè§å¨å®ä¾ï¼
# ä¸åä¼è¯
agent-browser --session agent1 open site-a.com
agent-browser --session agent2 open site-b.com
# æéè¿ç¯å¢åé
AGENT_BROWSER_SESSION=agent1 agent-browser click @e1
# ååºæ´»è·ä¼è¯
agent-browser session list
æªå¾è·¯å¾çº¦å®
建议ç»ä¸ä¿åå° SCREENSHOT_DIR ç¯å¢åéæå®çç®å½ï¼
SCREENSHOT_DIR=$(pwd)/media/screenshots
agent-browser screenshot ${SCREENSHOT_DIR}/case_11_step1.png
å ¸å使ç¨åºæ¯
ç»å½æµè¯
# æå¼ç»å½é¡µ
agent-browser open http://192.168.150.114:8913/login
# è·å页é¢å
ç´
agent-browser snapshot -i
# è¾åºï¼
# - textbox "请è¾å
¥ç¨æ·å" [ref=e1]
# - textbox "请è¾å
¥å¯ç " [ref=e2]
# - button "ç»å½" [ref=e3]
# å¡«å表å
agent-browser fill @e1 "admin"
agent-browser fill @e2 "admin123456"
# æªå¾
agent-browser screenshot ./step1_filled.png
# ç¹å»ç»å½
agent-browser click @e3
# çå¾
跳转
agent-browser wait --url "**/dashboard"
agent-browser snapshot -i
# æªå¾ç»æ
agent-browser screenshot ./step2_result.png
# å
³éæµè§å¨
agent-browser close
ä¸ playwright-skill 对æ¯
| ç¹æ§ | agent-browser-skill | playwright-skill |
|---|---|---|
| è°ç¨æ¹å¼ | ç¬ç« CLI å½ä»¤ | node run.js “code” |
| å ç´ å®ä½ | Snapshot + @ref | CSS éæ©å¨ |
| ç¶æä¿æ | èªå¨å®æ¤è¿ç¨ | æ¯æ¬¡å¯å¨æ°æµè§å¨ |
| AI å好度 | é«ï¼ä¸ä¸º AI è®¾è®¡ï¼ | ä¸ |
| 代ç å¤æåº¦ | ç®åå½ä»¤ | éå JS 代ç |