rancher-resource-troubleshooting
2
总安装量
1
周安装量
#71850
全站排名
安装命令
npx skills add https://github.com/futuretea/rancher-assistant --skill rancher-resource-troubleshooting
Agent 安装分布
windsurf
1
amp
1
cline
1
opencode
1
cursor
1
kimi-cli
1
Skill 文档
Rancher èµæºææ¥
è¯æåææ¥ Kubernetes èµæºé®é¢ãç®åæ¥å¿/äºä»¶æ¥è¯¢ç´æ¥æ§è¡ï¼å¤æè¯æå§æç» Sub-Agentã
ç´æ¥æä½ï¼æ é Sub-Agentï¼
| æä½ | å·¥å · | 使¶ç´æ¥ä½¿ç¨ |
|---|---|---|
| æ¥ç Pod æ¥å¿ | mcp__rancher__kubernetes_logs |
æä¾æç¡®çé群ãå½å空é´å Pod åç§° |
| æ¥çäºä»¶ | mcp__rancher__kubernetes_events |
æ¥çå½åç©ºé´æç¹å®èµæºçäºä»¶ |
| æè¿°èµæº | mcp__rancher__kubernetes_describe |
æ¥çåä¸ªèµæºç详ç»ä¿¡æ¯ |
| è·åèµæº | mcp__rancher__kubernetes_get |
è·ååä¸ªèµæºç YAML/JSON |
Sub-Agent å§æ
1. rancher-pod-diagnostician
ç¨äº: Pod å ¨é¢è¯æãå¤ Pod 对æ¯ãå·¥ä½è´è½½çº§ææ¥
使¶å§æ:
- ç¨æ·è¦æ±”è¯æ Pod”æ”为ä»ä¹ Pod 失败”
- éè¦ç»¼ååææ¥å¿ãäºä»¶åèµæºç¶æ
- å¤ Pod å¹¶è¡è¯æ
- Deployment/StatefulSet çº§å«ææ¥
åæ°:
{
"cluster": "c-abc123",
"namespace": "production",
"pod_name": "api-server-abc123",
"keyword": "error",
"tail_lines": 200
}
2. rancher-deployment-tracker
ç¨äº: é¨ç½²ç¸å ³é®é¢ææ¥
使¶å§æ:
- é¨ç½²å¤±è´¥åææ¥åæ´åå
- éè¦æ¥çåå¸åå²åçæ¬å·®å¼
- çæ§æ»å¨æ´æ°è¿ç¨
å³çæ
ç¨æ·è¯·æ±ï¼
ââ "æ¥ç Pod æ¥å¿" + æä¾ Pod å
â ââ ç´æ¥ä½¿ç¨ kubernetes_logs
â
ââ "æ¥çäºä»¶" + å½å空é´/èµæº
â ââ ç´æ¥ä½¿ç¨ kubernetes_events
â
ââ "æè¿°èµæº X"
â ââ ç´æ¥ä½¿ç¨ kubernetes_describe
â
ââ "è¯æ Pod" / "为ä»ä¹ Pod 失败" / "Pod ä¸å°±ç»ª"
â ââ å§æç» rancher-pod-diagnostician
â
ââ "ææ¥ Deployment" / "é¨ç½²å¤±è´¥"
â ââ å§æç» rancher-pod-diagnostician + rancher-deployment-tracker
â
ââ "å¤ Pod 对æ¯" / "è¿äº Pod æä»ä¹é®é¢"
â ââ å¹¶è¡å¯å¨å¤ä¸ª rancher-pod-diagnostician
â
ââ "å·¥ä½è´è½½æ¥å¿" / "ææ Pod çæ¥å¿"
ââ ç´æ¥ä½¿ç¨ kubernetes_logsï¼labelSelector èåå¤ Pod æ¥å¿ï¼
å¹¶è¡æ§è¡
å¤ Pod è¯æ
ç¨æ·: "è¯æå½åç©ºé´ production 䏿æå¤±è´¥ç Pod"
â æ¥éª¤ 1: kubernetes_list è·å Pod å表ï¼çéå¼å¸¸ Pod
â æ¥éª¤ 2: 为æ¯ä¸ªå¼å¸¸ Pod å¹¶è¡å¯å¨ diagnostician
â æ¥éª¤ 3: æ±æ»è¯æç»æ
Deployment å ¨é¢ææ¥
ç¨æ·: "ææ¥ Deployment api-server çé®é¢"
â å¹¶è¡å¯å¨ï¼
Agent 1: rancher-pod-diagnosticianï¼è¯æå
³è Podï¼
Agent 2: rancher-deployment-trackerï¼æ£æ¥åå¸åå²ååæ´ï¼
â 综ååæ
工使µ
æ¥éª¤ 1: è¯å«ææ¥ç®æ
- ä»ä¹èµæºï¼ï¼PodãDeploymentãService çï¼
- ä»ä¹é群åå½å空é´ï¼
- ææ²¡æå ·ä½çé误æè¿°ï¼
æ¥éª¤ 2: ç¡®å®ææ¥çç¥
- ç®åæ¥è¯¢ â ç´æ¥è°ç¨ MCP å·¥å ·
- Pod è¯æ â å§æ pod-diagnostician
- é¨ç½²é®é¢ â å§æ deployment-tracker
- å¤æåºæ¯ â å¹¶è¡å¤ä¸ª Agent
æ¥éª¤ 3: å¯å¨ææ¥
Task({
subagent_type: "general-purpose",
description: "è¯æ Pod " + pod_name,
prompt: `ä½ æ¯ rancher-pod-diagnosticianãè¯æé群 ${cluster} å½åç©ºé´ ${namespace} ä¸ Pod ${pod_name} çé®é¢ãè·å Pod 详æ
ãæ¥å¿åäºä»¶ï¼åææ ¹å ã`
})
æ¥éª¤ 4: å±ç¤ºç»æå¹¶å»ºè®®
ååºæ ¼å¼
Pod è¯æ
## Pod è¯æ: api-server-abc123
### ç¶æ
- Phase: Running
- Ready: 0/1 容å¨å°±ç»ª
- é坿¬¡æ°: 15
- èç¹: node-2
### é®é¢åç°
1. **CrashLoopBackOff**: å®¹å¨ `app` åå¤å´©æº
- æè¿éåºç : 137 (OOMKilled)
- å
åéå¶: 256Mi
- 建议: å¢å å
åéå¶å° 512Mi
### å
³é®æ¥å¿
[error] Out of memory: Kill process 1 (app)
### ç¸å
³äºä»¶
| æ¶é´ | ç±»å | åå | æ¶æ¯ |
|------|------|------|------|
| 2m ago | Warning | OOMKilling | Memory limit exceeded |
| 5m ago | Normal | Pulled | Container image pulled |
### 建议
1. å¢å å
åéå¶
2. æ£æ¥åºç¨å
åæ³æ¼
3. æ·»å èµæºçæ§
æ¥å¿æ¥è¯¢æå·§
- å
³é®è¯è¿æ»¤:
keyword: "error"å¿«éå®ä½é误 - æ¶é´èå´:
sinceSeconds: 3600æ¥çæè¿ 1 å°æ¶ - å¤ Pod èå:
labelSelector: "app=nginx"èåææ nginx Pod æ¥å¿ - å´©æºæ¥å¿:
previous: trueæ¥ç已崩æºå®¹å¨çæ¥å¿ - ç¹å®å®¹å¨:
container: "sidecar"æ¥çç¹å®å®¹å¨æ¥å¿
é误å¤ç
- Pod ä¸åå¨: 使ç¨
kubernetes_listæç´¢ç±»ä¼¼åç§°ç Pod - æ¥å¿ä¸ºç©º: æ£æ¥å®¹å¨ç¶æï¼å°è¯
previous: true - æéä¸è¶³: æç¤ºæ£æ¥ RBAC é ç½®