rsn-learning-outcomes
npx skills add https://github.com/bellabe/lean-os --skill rsn-learning-outcomes
Agent 安装分布
Skill 文档
Learning
Systematic improvement from experience. Convert outcomes into better future performance.
Core Principle
Learning is not automatic. Experience without reflection is just repetition. Learning requires deliberate extraction of insight and updating of beliefs and behaviors.
Experience â Extract â Update â Apply â Better Outcomes
Mode Selection
| Mode | Question | Output | Trigger |
|---|---|---|---|
| Single-loop | Did action work? | Corrected action | Gap between expected/actual |
| Double-loop | Is frame right? | Updated frame | Pattern of single-loop failures |
| Reflection | What can we learn? | Transferable insights | Experience completed |
| Experimentation | Should we test this? | Validated/invalidated belief | Belief needs validation |
| Calibration | How accurate are we? | Adjusted confidence rules | Predictions need tuning |
Decision Tree
Is there a gap between expected and actual?
YES â Is this a pattern (3+ similar failures)?
YES â Double-loop (question the frame)
NO â Single-loop (fix the action)
NO â
Has an experience completed?
YES â Reflection (extract insights)
NO â
Do you have a belief that needs validation before commitment?
YES â Experimentation (test the belief)
NO â
Have predictions been consistently off?
YES â Calibration (adjust confidence)
NO â No learning mode needed
Mode Summaries
Single-Loop
Purpose: Correct action within existing frame.
Mental model: Thermostat â detect deviation, adjust action, return to target. The goal is not questioned.
Process: Gap detected â Diagnose cause â Identify correction â Verify fix â Prevent recurrence
Key rules:
- Fix the proximate cause
- Don’t question the goal (yet)
- Add prevention to avoid repeat
- Check: is this a pattern? If yes â double-loop
Output: Corrected action with prevention
Double-Loop
Purpose: Question and update the frame itself.
Mental model: Not just adjusting thermostat, but asking: “Is heating the right goal?”
Process: Pattern detected â Examine current frame â Challenge assumptions â Construct new frame â Validate change
Key rules:
- Requires 3+ single-loop failures (pattern)
- Articulate current frame (goals, assumptions, constraints)
- Challenge each element with evidence
- Test new frame before full commitment
Output: Updated frame with validation plan
Reflection
Purpose: Extract transferable insight from experience.
Mental model: Mine the experience for reusable gold.
Process: Capture experience â Analyze what worked/didn’t â Extract insights â Update beliefs â Create artifacts â Disseminate
Key rules:
- Reflection is scheduled, not accidental
- Analyze both successes and failures
- Specify conditions when insight applies
- Create persistent artifacts (heuristics, playbooks, checklists)
Output: Insights and artifacts for future use
Experimentation
Purpose: Test belief through deliberate action before commitment.
Mental model: Scientific method applied to operational decisions.
Process: Formulate hypothesis â Design experiment â Execute â Analyze results â Conclude â Act
Key rules:
- Hypothesis must be falsifiable
- Define success criteria before testing
- Control variables where possible
- Don’t peek at results early
Output: Validated or invalidated belief with next steps
â references/experimentation.md
Calibration
Purpose: Adjust prediction confidence based on track record.
Mental model: Weather forecaster â when I say 80% confident, it should be right 80% of the time.
Process: Assemble track record â Stratify by confidence level â Calculate calibration error â Identify patterns â Define adjustment rules
Key rules:
- Need 30+ predictions for meaningful calibration
- Stratify by domain (calibration varies)
- Adjust gradually, not dramatically
- Monitor ongoing calibration
Output: Calibration adjustment rules
Output Format
Every learning output includes:
## [Mode]: [Topic]
**Trigger:** [What triggered this learning mode]
**Analysis:**
[Mode-specific analysis]
**Conclusion:**
[What was learned/changed]
**Artifacts:**
- [Any persistent outputs: rules, checklists, playbooks]
**Next:**
- [Actions to take]
- [What to monitor]
Mode Transitions
| From | To | Trigger |
|---|---|---|
| Single-loop | Double-loop | Pattern detected (3+ similar failures) |
| Double-loop | Experimentation | New frame needs validation |
| Experimentation | Reflection | Experiment completed |
| Reflection | Calibration | Predictions were off |
| Any | Single-loop | New gap detected |
Learning â Other Skills Handoff
| Learning Output | Next Skill |
|---|---|
| Corrected action | Causal (execute) |
| New frame | Thinking (reason with new assumptions) |
| Insight about perception | Perceiving (adjust attention) |
| Validated hypothesis | Causal (plan rollout) |
| Calibration rule | All thinking modes (adjust confidence) |
Anti-Patterns
| Avoid | Do Instead |
|---|---|
| No reflection time | Schedule deliberate reflection |
| Blame focus | Focus on system/process |
| Premature double-loop | Require pattern of failures |
| Peeking at experiment results | Wait for full duration |
| Over-adjusting calibration | Gradual adjustments |
| Insight hoarding | Plan dissemination |
References
| File | Content |
|---|---|
| single-loop.md | Action correction within frame |
| double-loop.md | Frame examination and update |
| reflection.md | Insight extraction process |
| experimentation.md | Hypothesis testing methods |
| calibration.md | Confidence adjustment |