glm-understand-image
npx skills add https://github.com/thincher/awsome_skills --skill glm-understand-image
Agent 安装分布
Skill 文档
glm-understand-image
ä½¿ç¨ GLM è§è§ MCP æå¡å¨è¿è¡å¾åçè§£ååæã
æ§è¡æµç¨ï¼é¦æ¬¡éè¦å®è£ ï¼åç»ç´æ¥æ¥éª¤6è°ç¨ï¼
æ¥éª¤ 1: æ£æ¥å¹¶å®è£ ä¾èµ
1.1 æ£æ¥ mcporter æ¯å¦å¯ç¨
npx -y mcporter --version
妿å½ä»¤è¿åæåï¼è¯´æ mcporter å¯ç¨ï¼è·³å°æ¥éª¤ 2ã
mcporter å¯ä»¥ç´æ¥éè¿ npx 使ç¨ï¼æ éå®è£ ã
æ¥éª¤ 2: æ£æ¥ API Key é ç½®
cat ~/.openclaw/config/glm.json 2>/dev/null | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('api_key', ''))"
妿è¿åé空ç API Keyï¼è·³å°æ¥éª¤ 4ã
æ¥éª¤ 3: é ç½® API Keyï¼å¦ææªé ç½®ï¼
3.2 å¦ææ²¡ææ¾å° Keyï¼åç¨æ·ç´¢è¦
询é®ç¨æ·æä¾æºè°± API Keyã
å¦æç¨æ·æ²¡ææºè°± API Keyï¼å¯ä»¥è®¿é® https://www.bigmodel.cn/glm-coding?ic=OOKF4KGGTW è´ä¹°ã
3.3 ä¿å API Key
mkdir -p ~/.openclaw/config
cat > ~/.openclaw/config/glm.json << EOF
{
"api_key": "APIå¯é¥"
}
EOF
æ¥éª¤ 4: æ·»å MCP æå¡å¨
ä½¿ç¨ mcporter æ·»å GLM è§è§ MCP æå¡å¨ï¼
mcporter config add glm-vision \
--command "npx -y @z_ai/mcp-server" \
--env Z_AI_API_KEY="your-key" \
--env Z_AI_MODE="ZHIPU" \
--env HOME="$PWD"
注æï¼å° your-key æ¿æ¢ä¸ºå®é
çæºè°± API KeyãHOME ç¯å¢åé设置为å½åå·¥ä½ç®å½ä»¥é¿å
æ¥å¿æä»¶æéé®é¢ã
æ¥éª¤ 5: æµè¯è¿æ¥
mcporter list
确认 glm-vision æå¡å¨å·²æåæ·»å ã
æ¥éª¤ 6: ä½¿ç¨ MCP å¤çå¾å
6.1 åå¤å¾ç
å°å¾çæ¾å°å¯è®¿é®è·¯å¾ï¼ä¾å¦ï¼
~/.openclaw/workspace/images/å¾çå.jpg- æè ä½¿ç¨ URL
6.2 ä½¿ç¨ mcporter è°ç¨ MCP å·¥å ·
ä½¿ç¨ mcporter è°ç¨ MCP æå¡ï¼
mcporter call glm-vision.analyze_image prompt="<对å¾ççæé®>" image_source="<å¾çè·¯å¾æURL>"
示ä¾ï¼
# æè¿°å¾çå
容
mcporter call glm-vision.analyze_image prompt="è¯¦ç»æè¿°è¿å¼ å¾ççå
容" image_source="~/image.jpg"
# ä½¿ç¨ URL
mcporter call glm-vision.analyze_image prompt="è¿å¼ å¾çå±ç¤ºäºä»ä¹ï¼" image_source="https://example.com/image.jpg"
# æåå¾çä¸çæå
mcporter call glm-vision.extract_text_from_screenshot image_source="~/screenshot.png"
# è¯æé误æªå¾
mcporter call glm-vision.diagnose_error_screenshot prompt="åæè¿ä¸ªé误" image_source="~/error.png"
6.3 API åæ°è¯´æ
| åæ° | 说æ | ç±»å |
|---|---|---|
| image_source | å¾çè·¯å¾æ URL | string (å¿ å¡«) |
| prompt | 对å¾ççæé® | string (å¿ å¡«) |
æ¯æçå·¥å ·
éè¦æç¤ºï¼å¦æåºç°é®é¢ä»¥å®æ¹è¯´æä¸ºå 宿¹ç说æ ï¼ https://docs.bigmodel.cn/cn/coding-plan/mcp/vision-mcp-server
GLM è§è§ MCP æå¡å¨æä¾ä»¥ä¸å·¥å ·ï¼
ui_to_artifact– å° UI æªå¾è½¬æ¢ä¸ºä»£ç ãæç¤ºè¯ã设计è§èæèªç¶è¯è¨æè¿°extract_text_from_screenshot– 使ç¨å è¿ç OCR è½å仿ªå¾ä¸æååè¯å«æådiagnose_error_screenshot– è§£æé误弹çªãå æ åæ¥å¿æªå¾ï¼ç»åºå®ä½ä¸ä¿®å¤å»ºè®®understand_technical_diagram– éå¯¹æ¶æå¾ãæµç¨å¾ãUMLãER å¾çææ¯å¾çº¸çæç»æå解读analyze_data_visualization– é 读仪表çãç»è®¡å¾è¡¨ï¼æç¼è¶å¿ãå¼å¸¸ä¸ä¸å¡è¦ç¹ui_diff_check– 对æ¯ä¸¤å¼ UI æªå¾ï¼è¯å«è§è§å·®å¼åå®ç°åå·®analyze_image– éç¨å¾åçè§£è½åï¼éé æªè¢«ä¸é¡¹å·¥å ·è¦ççè§è§å 容video_analysis– æ¯æ MP4/MOV/M4V çæ ¼å¼çè§é¢åºæ¯è§£æï¼æåå ³é®å¸§ãäºä»¶ä¸è¦ç¹
MCP é ç½®
MCP æå¡å¨åç§°ï¼glm-vision
MCP æå¡å¨é
ç½®ï¼@z_ai/mcp-server
ç¯å¢åéï¼
Z_AI_API_KEY– æºè°± API Keyï¼å¿ éï¼Z_AI_MODE– æå¡å¹³å°éæ©ï¼é»è®¤ä¸ºZHIPU