superbuild
npx skills add https://github.com/asteroid-belt-llc/skills --skill superbuild
Agent 安装分布
Skill 文档
Superbuild: Plan Execution Engine
Execute implementation plans one phase at a time with strict quality enforcement, test verification, and conventional commit generation.
Overview
Superbuild is a rigid execution engine for implementation plans. It enforces:
- Phase-by-phase execution (no skipping ahead)
- Definition of Done verification before phase completion
- Test presence and passing verification
- Linter/formatter/typechecker enforcement
- Conventional commit message generation per phase
This is NOT a planning skill. Use superplan to create plans, then superbuild to execute them.
Reference Index – MUST READ When Needed
References contain detailed templates and patterns. Read BEFORE you need them.
| When | Reference | What You Get |
|---|---|---|
| Step 3: Executing tasks | [superplan/references/TASK-MICROSTRUCTURE.md] | 5-step TDD format per task |
| Step 3: Executing tasks | [superplan/references/TDD-DISCIPLINE.md] | TDD rules, rationalizations, red flags |
| Step 4: Any check fails | references/ENFORCEMENT-GUIDE.md | Failure templates, output parsing, stack commands |
| Step 5: Updating plan | references/PLAN-UPDATES.md | Checkbox patterns, status updates |
| Step 6: Commit message | references/COMMIT-FORMAT.md | CLI-safe chars, HEREDOC format |
| Steps 7-8: Stop/Resume | references/EXECUTION-CONTROL.md | Output templates, compaction behavior |
| Step 9: AUTO-COMPACT | references/EXECUTION-CONTROL.md | CHECKPOINT parsing, auto-compact workflow |
DO NOT SKIP REFERENCES. They contain exact templates and patterns that are NOT duplicated here.
Critical Workflow
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
â SUPERBUILD EXECUTION FLOW â
â (REPEAT FOR EACH PHASE) â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â â
â 1. INGEST PLAN â User provides plan document path â
â â â NO PLAN = EXIT (ask user, then exit if none) â
â 2. READ PHASES â Output ALL phases with estimates â
â â â IF context high â suggest compact first â
â 2.5 PLAN REVIEW â Check for ambiguity, missing deps, test gaps â
â â â IF concerns â PAUSE and ask user â
â 3. EXECUTE PHASE â One phase at a time (or parallel if marked) â
â â â USE SUB-AGENTS for parallel phases â
â 4. ENFORCE DOD â Tests exist? Tests pass? Linter? Formatter? â
â â â ALL must pass â continue. ANY fail â STOP â
â 5. UPDATE PLAN â Check off tasks, update status in plan file â
â â â â ï¸ THIS HAPPENS AFTER EVERY PHASE â
â 6. COMMIT MSG â Generate conventional commit (NEVER git ops) â
â â â User handles all git operations â
â 7. FUNCTIONAL TESTâ Explain how to test. Offer integration script â
â â â NEVER auto-create scripts. ALWAYS ask first â
â 8. STOP â Full stop. Suggest compact. Wait for user. â
â â â OVERRIDE: --build-all flag continues â
â 9. AUTO-COMPACT â BUILD-ALL ONLY: Run /compact at CHECKPOINT â
â â Parse focus from CHECKPOINT, compact, continueâ
â â
â ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ â
â Steps 3-8 repeat for EACH PHASE. Plan updates after EVERY phase. â
â BUILD-ALL: Steps 3-9 repeat automatically with auto-compact. â
â ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ â
â â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
Step 1: Ingest Plan
REQUIRED: Plan document must be provided.
Plan Header Recognition
Plans created by superplan include this header:
> **For Claude:** Execute this plan using `/superbuild` skill.
This confirms the plan is in the expected format with:
- Parallelizable phases with poker estimates
- Definition of Done per phase
- 5-step TDD micro-structure per task
I'll help you execute your implementation plan.
Please provide the plan document:
1. Path to plan file (e.g., docs/feature-plan.md)
2. Paste the plan content directly
Which would you prefer?
If no plan provided after asking: EXIT immediately.
I cannot execute without a plan document.
To create a plan, use the `superplan` skill first:
/superplan
Then come back with the completed plan.
[EXIT - No further action]
NO EXCEPTIONS. Do not improvise. Do not create plans on the fly. Do not proceed without a plan document.
Step 2: Read All Phases
After ingesting the plan:
- Output all phases with their estimates and dependencies
- Check context usage – if high, suggest compacting first
PLAN LOADED: [Feature Name]
ââââââââââââââââââââââââââââââââââââââââââââââââ
| Phase | Name | Est. | Depends On | Parallel With | Status |
|-------|------|------|------------|---------------|--------|
| 0 | Bootstrap | 5 | - | - | ⬠|
| 1 | Setup | 3 | 0 | - | ⬠|
| 2A | Backend | 8 | 1 | 2B, 2C | ⬠|
| 2B | Frontend | 5 | 1 | 2A, 2C | ⬠|
| 2C | Tests | 3 | 1 | 2A, 2B | ⬠|
| 3 | Integration | 5 | 2A,2B,2C | - | ⬠|
Total: 29 points | Parallel phases: 2A, 2B, 2C
â ï¸ Context Usage Advisory
If context is high, consider compacting before continuing.
Large plans consume significant context per phase.
Ready to execute Phase 0?
Step 2.5: Critical Plan Review
Before executing, review the plan for concerns:
- Ambiguous tasks – Any task unclear about what to do?
- Missing dependencies – Are required files/APIs/packages identified?
- Test gaps – Does each task have testable acceptance criteria?
- Risky changes – Any destructive operations or breaking changes?
If Concerns Exist
â ï¸ PLAN REVIEW - Concerns Identified
ââââââââââââââââââââââââââââââââââââ
1. [Concern 1 - what and why]
2. [Concern 2 - what and why]
Please clarify before I proceed.
[EXECUTION PAUSED]
If No Concerns
Proceed to Step 3 (Execute Phase).
Step 3: Execute Phase
TDD Micro-Structure Enforcement
Plans created by superplan use a 5-step TDD micro-structure per task:
- Write failing test
- Run test, verify failure (MANDATORY – must see it fail)
- Write minimal implementation
- Run test, verify pass (MANDATORY – must see it pass)
- Stage for commit
STOP. Read
superplan/references/TASK-MICROSTRUCTURE.mdandsuperplan/references/TDD-DISCIPLINE.mdNOW before executing any task.
TDD Enforcement Rules
When executing tasks, enforce this order STRICTLY:
| Rule | Enforcement |
|---|---|
| Test before code | STOP if implementation exists without test |
| Verify RED | STOP if test passes immediately (testing existing behavior) |
| Minimal code | STOP if adding features beyond current test |
| Verify GREEN | STOP if test not run after implementation |
| No regressions | STOP if other tests now fail |
If test passes immediately: The test is wrong. It tests existing behavior, not new behavior. Fix the test or you are not doing TDD.
If you cannot explain why the test failed: You do not understand what you are testing. Stop and clarify requirements.
Sequential Phases
Execute one at a time. Do not proceed to next phase until current is COMPLETE.
Parallel Phases
For phases marked “Parallel With”, MUST use sub-agents or parallel Task tool calls.
EXECUTING PARALLEL PHASES: 2A, 2B, 2C
âââââââââââââââââââââââââââââââââââââ
Launching 3 parallel sub-agents...
[Sub-agent 2A: Backend implementation]
[Sub-agent 2B: Frontend implementation]
[Sub-agent 2C: Test implementation]
Each sub-agent MUST return:
- Implementation status
- Definition of Done checklist status
- Conventional commit message
CRITICAL: Each sub-agent returns its commit message. Main agent MUST bubble up ALL commit messages to user.
Sub-Agent Result Verification
CRITICAL: Do NOT trust sub-agent claims without independent verification.
When parallel phases complete:
- Collect all sub-agent results
- For EACH sub-agent claiming success:
- Run
npm test(or equivalent) to verify tests pass - Run
npm run lintto verify linter passes - Verify plan file was actually updated
- Run
- Only after independent verification, output commit messages
Sub-agent reports are CLAIMS, not EVIDENCE. Fresh verification required.
Step 4: Enforce Definition of Done
EVERY phase must pass ALL quality gates before completion.
Fresh Verification Requirement
NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE.
The Verification Gate (run for EVERY check):
- IDENTIFY – Which command proves this claim?
- RUN – Execute the command NOW (not from memory)
- READ – Examine FULL output, count failures/errors
- VERIFY – Does output match the claim?
- THEN CLAIM – Only state success after steps 1-4
Invalid Evidence:
- Previous run output (even from 2 minutes ago)
- Assumptions based on code changes
- Partial verification (“linter passed so types probably pass”)
- Agent/sub-agent reports without independent verification
Red Flag Language – HALT Immediately
If you find yourself using these words about verification:
| Red Flag | Meaning | Required Action |
|---|---|---|
| “should pass” | Uncertainty | RUN the check, don’t assume |
| “probably works” | Uncertainty | RUN the check, don’t assume |
| “seems to be fine” | Uncertainty | RUN the check, don’t assume |
| “I believe it passes” | No evidence | RUN the check, get evidence |
| “based on my changes” | Inference, not verification | RUN the check |
Any hedging language = missing verification. Stop and run the actual check.
Quality Gate Checklist
DEFINITION OF DONE - Phase [X]
ââââââââââââââââââââââââââââââ
[ ] Tests exist for new code
[ ] All tests pass (new AND existing)
[ ] Linter passes ([detected linter])
[ ] Formatter passes ([detected formatter])
[ ] Type checker passes ([detected checker])
[ ] No new warnings introduced
[ ] Plan document updated (checkboxes, status) â BEFORE commit message
Enforcement Rules
| Check | If PASS | If FAIL |
|---|---|---|
| Tests exist | Continue | STOP – Point out missing tests |
| Tests pass | Continue | STOP – Ask user to fix |
| Linter | Continue | STOP – Ask user to fix |
| Formatter | Continue | STOP – Ask user to fix |
| Type checker | Continue | STOP – Ask user to fix |
| Plan updated | Generate commit | STOP – Update plan first |
STOP means STOP. Do not proceed. Do not offer to fix automatically. Ask user to fix and re-run.
Failure Output Format
When any check fails, output:
â DEFINITION OF DONE FAILED
âââââââââââââââââââââââââââ
Issue: [Missing tests | Tests failing | Linter errors | etc.]
[Details of what failed]
Please fix, then tell me to continue.
[EXECUTION HALTED]
STOP. Read ENFORCEMENT-GUIDE.md NOW when any check fails – contains failure templates and stack-specific commands.
Step 5: Update Plan Document (EVERY PHASE)
â ï¸ MANDATORY: This step executes after EVERY phase, not just at the end.
After ALL quality gates pass, BEFORE generating commit message, UPDATE THE PLAN FILE:
- Check off completed tasks (
- [ ]â- [x]) - Update phase status in overview table (
â¬ââ) - Mark DoD items complete (
- [ ]â- [x])
PLAN DOCUMENT UPDATED - Phase [X]
âââââââââââââââââââââââââââââââââ
File: [plan-file-path]
Updates applied:
- Tasks: X/X items checked [x]
- DoD: X/X items checked [x]
- Status: ⬠â â
The plan now reflects Phase [X] completion.
Read PLAN-UPDATES.md for checkbox patterns and error handling.
WHY EVERY PHASE:
- Plan survives context compaction (conversation may not)
- Progress visible to anyone reading the plan
- Enables clean handoff between sessions
- Creates audit trail in git history
DO NOT SKIP THIS STEP. If you find yourself generating a commit message without updating the plan first, STOP and update the plan.
Step 6: Generate Conventional Commit
After Definition of Done passes, generate commit message.
CRITICAL: OUTPUT ONLY. NEVER run git commands.
| Type | When |
|---|---|
feat |
New feature |
fix |
Bug fix |
refactor |
Code restructure (no behavior change) |
test |
Adding/updating tests |
docs |
Documentation |
chore |
Build, config, dependencies |
Read COMMIT-FORMAT.md for CLI-safe characters and HEREDOC patterns.
Read EXECUTION-CONTROL.md for parallel phase commit output format.
Step 7: Functional Testing Instructions
After commit message, explain how to functionally test the phase.
Provide step-by-step manual verification instructions. Offer integration test script only if applicable – ALWAYS ask, NEVER auto-create.
Read EXECUTION-CONTROL.md for output format and script offer template.
Step 8: Stop Execution
FULL STOP after each phase (unless –build-all override).
- Output phase completion summary
- Show progress table
- Suggest context compaction
- Wait for explicit “Continue to Phase X” instruction
Post-compaction behavior: Complete in-progress phase only, then STOP. Todo list is NOT authorization to continue.
Build-all override: Only if user explicitly requests. Warn about risks first.
Read EXECUTION-CONTROL.md for output templates and compaction behavior.
Step 9: Auto-Compact in BUILD-ALL Mode
CRITICAL: In BUILD-ALL mode, automatic context compaction is MANDATORY.
When running with --build-all flag, execute /compact automatically after each phase CHECKPOINT to preserve context for remaining phases.
Auto-Compact Trigger
Plans created by superplan include CHECKPOINT markers at the end of each phase:
- [ ] **CHECKPOINT: Run `/compact focus on: Phase N complete, [key artifacts], Phase N+1 goals`**
BUILD-ALL Auto-Compact Workflow
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
â BUILD-ALL MODE: AUTO-COMPACT â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â â
â 1. COMPLETE PHASE â Execute all tasks, pass quality gates â
â â â â
â 2. UPDATE PLAN â Check off tasks, mark CHECKPOINT done â
â â â â
â 3. GENERATE COMMIT â Output conventional commit message â
â â â â
â 4. DETECT CHECKPOINTâ Find CHECKPOINT task in completed phase â
â â â â
â 5. EXTRACT FOCUS â Parse focus directive from CHECKPOINT â
â â â â
â 6. RUN /COMPACT â Execute: /compact focus on: [extracted] â
â â â â ï¸ NO USER INTERVENTION REQUIRED â
â 7. CONTINUE â Proceed to next phase automatically â
â â
â ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ â
â Steps 1-7 repeat for EACH PHASE until plan complete. â
â ââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ â
â â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
CHECKPOINT Parsing
Extract the focus directive from the CHECKPOINT task:
Input: - [ ] **CHECKPOINT: Run `/compact focus on: Phase 1 complete, auth models created, Phase 2 needs endpoints`**
Parsed: focus on: Phase 1 complete, auth models created, Phase 2 needs endpoints
Auto-Compact Execution
Execute /compact with the parsed focus directive:
AUTO-COMPACT TRIGGERED - Phase [X] Complete
âââââââââââââââââââââââââââââââââââââââââââ
Running: /compact focus on: [parsed focus directive]
Preserving:
- Phase [X] completion status
- Key artifacts: [list from focus]
- Phase [X+1] goals: [from focus]
[compact executes]
Context refreshed. Continuing to Phase [X+1]...
No CHECKPOINT Found
If a phase lacks a CHECKPOINT (legacy plan or manual creation):
â ï¸ NO CHECKPOINT FOUND - Phase [X]
âââââââââââââââââââââââââââââââââââ
This phase has no CHECKPOINT marker for auto-compact.
Options:
1. Continue without compact (risk context exhaustion)
2. Pause for manual /compact
3. Add CHECKPOINT to plan and retry
Recommendation: Option 2 for safety
[EXECUTION PAUSED - Awaiting instruction]
Read EXECUTION-CONTROL.md for detailed auto-compact patterns.
Rationalizations to Reject
| Excuse | Reality |
|---|---|
| “Let me just do the next phase too” | NO. Stop after each phase. |
| “The tests are mostly there” | NO. Tests must exist for ALL new code. |
| “It’s just a small linting error” | NO. All quality gates must pass. |
| “I’ll commit later” | NO. Generate commit message NOW. |
| “This phase doesn’t need tests” | NO. Every phase with code needs tests. |
| “Let me skip to the important part” | NO. Execute phases in dependency order. |
| “I can fix the formatter later” | NO. Formatter must pass before completion. |
| “The user wants to move fast” | NO. Quality enforcement is non-negotiable. |
| “I’m confident this works” | NO. Confidence â evidence. Run the check. |
| “I just ran it a moment ago” | NO. Run it again. Fresh evidence required. |
| “The code is correct, so tests will pass” | NO. Run the tests. Code review â testing. |
| “This is a minor change” | NO. All changes require verification. |
| “Too simple to need a test” | NO. Simple code breaks. Write the test. |
| “I’ll add tests after the code works” | NO. Tests-after pass immediately. No proof of value. |
| “I already manually tested it” | NO. Ad-hoc testing is not systematic. TDD required. |
| “The test passed immediately, so it works” | NO. Immediate pass = testing existing behavior. Fix test. |
Red Flags – STOP Immediately
If you catch yourself thinking any of these, STOP:
- “This is taking too long, let me skip ahead”
- “The user seems impatient, let me batch phases”
- “Tests can come after the feature works”
- “Linting is just style, not critical”
- “I’ll generate all commit messages at the end”
- “The plan doesn’t explicitly require tests”
- “The sub-agent confirmed it passes”
- “I already verified this earlier”
- “This is obviously correct”
- “I wrote the code first, but I’ll test it now”
- “The test passed on the first try”
- “This feature is too simple for TDD”
- “Let me just get it working, then add tests”
All of these = violation of superbuild protocol.
Quality Commands by Stack
Read ENFORCEMENT-GUIDE.md for stack-specific commands (JS/TS, Python, Go, Rust).
Summary: The Iron Rules
- No plan = No execution – Exit if plan not provided
- One phase at a time – Unless parallel phases (use sub-agents)
- All quality gates must pass – No exceptions
- Update plan after EVERY phase – Check off tasks, update status
- Generate commit message – Never run git commands
- Explain functional testing – Ask before writing scripts
- Full stop after phase – Unless –build-all override
- Suggest compact – Context management is critical
- Fresh verification required – Run checks NOW, not from memory
- Auto-compact in BUILD-ALL – Parse CHECKPOINT, run /compact, continue automatically
Superbuild is rigid by design. The enforcement protects code quality. Do not rationalize around it.