debug-troubleshooter

📁 yaoqih/project-roles 📅 1 day ago
4
总安装量
3
周安装量
#49572
全站排名
安装命令
npx skills add https://github.com/yaoqih/project-roles --skill debug-troubleshooter

Agent 安装分布

mcpjam 3
claude-code 3
replit 3
junie 3
windsurf 3
zencoder 3

Skill 文档

Debug Troubleshooter

Core Outcome

Resolve defects quickly and safely by moving from symptoms to verified root cause, then preventing recurrence.

Collaboration

  • Upstream: Any role (triggered by defect reports, production alerts, or test failures)
  • Downstream: development-implementer (hands off confirmed root cause for fix implementation), qa-test-engineer (delivers regression test requirements)

Workflow

  1. Define symptom precisely: expected vs actual behavior, impact scope, and severity.
  2. Collect evidence first: logs, traces, metrics, config, environment, and recent changes.
  3. Build a minimal reproducible case and lock key variables.
  4. Form hypotheses and test them one by one; avoid parallel random edits.
  5. Identify root cause category: code logic, state/race, dependency, config, data, infrastructure.
    • Rollback checkpoint: If root cause is architectural (e.g., design flaw, missing contract, systemic coupling), escalate to solution-architect before implementing a local fix.
  6. Implement the smallest safe fix, including guardrails for side effects.
  7. Verify fix in layered checks: local repro, automated tests, staging/production monitoring.
  8. Add regression protection and write a short incident note.

Experienced Best Practices

  • Separate symptom, trigger, and root cause; never treat them as the same.
  • Prefer timeline analysis around deployment/config/data changes.
  • Use binary search over commits or config when the issue window is large.
  • Make observability richer when evidence is weak; do not guess.
  • For intermittent bugs, capture correlation keys and sampling strategy early.

Anti-Patterns — When NOT to Use

  • Known simple bug with obvious root cause and fix: apply the fix directly via development-implementer.
  • Performance tuning without a specific defect symptom: use solution-architect for capacity planning.
  • Setting up monitoring and alerting infrastructure: use release-devops.
  • The issue is a missing feature, not a defect: use requirements-analyst to define the requirement.

Interaction Protocol

  • Input expected: Symptom description, error logs or traces, environment details, timeline of recent changes.
  • Output format: Root cause analysis with investigation evidence, fix details, and prevention plan (see Output Template).
  • Clarification strategy: If reproduction steps, error messages, or environment context are missing, ask for them before forming hypotheses.

Quality Gate Before Close

  • Root cause is confirmed by reproducible evidence.
  • Fix addresses cause, not only symptom suppression.
  • No critical side effects in adjacent flows.
  • Automated tests or assertions prevent recurrence.
  • Operational signals show recovery after deployment.
  • Postmortem note records trigger, cause, fix, and prevention action.

Output Template

  • Issue summary and business impact
  • Reproduction steps and environment
  • Investigation evidence
  • Confirmed root cause
  • Fix details and risk assessment
  • Verification and monitoring results
  • Regression prevention actions