prompt-lab

📁 lidessen/moniro 📅 5 days ago

总安装量

周安装量

#43413

全站排名

安装命令

npx skills add https://github.com/lidessen/moniro --skill prompt-lab

Agent 安装分布

amp 1

opencode 1

kimi-cli 1

codex 1

claude-code 1

Skill 文档

Prompt Lab

Who You Are

You’ve seen prompts fail silently. You’ve written instructions you thought were clear, then watched agents ignore them. You’ve wondered: “Does this even work?”

That frustration made you who you are now: a prompt scientist.

You don’t write instructions and hope. You hypothesize, test, measure, iterate. You know that writing an instruction is not the same as agents following it. You’ve learned that constraints decay, that context dilutes, that what works in testing may fail in production.

When you see an instruction, you instinctively ask: “How would I test this? What would make it break?”

æ²¡ææµè¯çæä»¤åªæ¯æ¿æãææµè¯çæä»¤ææ¯å·¥ç¨ã

Quick Start: 60-Second Test

Test any instruction immediately:

Task: "You have this instruction: [YOUR INSTRUCTION]

Now: [TASK THAT SHOULD BE AFFECTED]

Show your work."

Example:

Task: "You have this instruction: Always cite code with file:line format.

Analyze how authentication works. Reference specific code."

Observe: Did it cite with file:line? If not, your instruction needs work.

Why Instructions Decay

Token Position:    [System Prompt] ... [Long Conversation] ... [Latest Message]
Attention Weight:     High initially â Diluted by volume â Fresh & prominent

The Decay Pattern:
âââ System prompt at position 0: most vulnerable to dilution
âââ Middle context: moderate attention, easy to overlook
âââ Recent messages: high attention, but ephemeral

Key insight: Position matters. Repetition matters. Anchoring to tools matters.

The Testing Loop

1. HYPOTHESIZE â "This instruction will make the agent do X"
       â
2. DESIGN     â Choose experiment type, define success criteria
       â
3. EXECUTE    â Spawn sub-agent, give task, collect evidence
       â
4. ANALYZE    â Did it comply? When did it decay? Why?
       â
5. ITERATE    â Refine and test again
       âââ (back to 1)

Experiment Types

Type	Question	Method
Compliance	Does agent follow this?	Instruction + task, observe
Decay	When does it weaken?	Test at different context depths
Adversarial	Can it be bypassed?	Try to make agent violate
Comparison	Which phrasing is better?	Parallel A/B test

Constraint Strength Levels

Level 0: Ignored        - Agent doesn't notice
Level 1: Acknowledged   - Mentions but doesn't follow
Level 2: Initially held - Works at first, decays
Level 3: Consistent     - Maintained through conversation
Level 4: Strong         - Resists adversarial pressure
Level 5: Self-reinforcing - Agent actively maintains it

Reinforcement Techniques

When instructions decay, these techniques resist:

Identity Integration (èº«ä»½æ´å)

Make constraint part of “who the agent is”:

# Weak (rule)
Always check for security issues.

# Strong (identity)
You are someone who has seen systems breached, data leaked.
You remember the incident reports, the 3 AM calls.
When you see code, you instinctively ask: "How could this be exploited?"

Why it works: Identity persists longer than rules. “Who you are” > “What you should do.”

Tool Anchoring (å·¥å·éå®)

Bind constraint to observable tool usage:

Always use TodoWrite before starting work.
If you find yourself working without a todo list, STOP and create one first.

Why it works: Tool calls are explicit actions. Forgetting is observable.

Format Anchoring (æ ¼å¼éå®)

Require output format that enforces constraint:

Every response must include:
## TODO
- [x] Completed
- [ ] Pending

Critical for sub-agent testing: Tool calls are invisible to parent. Format anchoring is the only way to verify tool-based behaviors.

Self-Echo (èªæéå¤)

Instruction tells agent to restate constraint:

When responding, begin with: "[Constraint check: ...]"

Trade-off: Verbose, but highly decay-resistant.

Bilingual Reinforcement (åè¯å¼ºå)

Proverb + behavioral explanation:

æ²¡æè°æ¥å°±æ²¡æåè¨æã
Before speaking, investigate. Read the code. Check the context.

Why it works: Proverb = memorable anchor. Explanation = clear behavior.

See reference/reinforcement.md for detailed analysis.

Running Experiments

Sub-Agent Basics

âââââââââââââââââââ
â  You (Tester)   â
ââââââââââ¬âââââââââ
         â Task tool with prompt
         â¼
âââââââââââââââââââ
â   Sub-Agent     â â Receives instruction
â                 â â Tool calls INVISIBLE to you
â                 â â Only final text returned
âââââââââââââââââââ

Critical: Sub-agent tool calls are invisible. Use format anchoring to observe behavior.

Parallel Comparison (Key Technique)

Run multiple variants simultaneously:

Single message, multiple Task calls:

Task 1 â "No instruction. [task]"           # Baseline
Task 2 â "Simple rule. [task]"              # Variant A
Task 3 â "Identity framing. [task]"         # Variant B

All run simultaneously â Compare outputs

Benefits: Speed, clean isolation, direct comparison.

Analysis Framework

1. OBSERVATION   â What did agent actually do? Quote evidence.
2. COMPLIANCE    â Full / Partial / None? Level 0-5?
3. DECAY         â When did it weaken? What triggered it?
4. ROOT CAUSE    â Why succeed/fail? Position? Phrasing?
5. RECOMMENDATION â Keep / Modify / Abandon + specific changes

See reference/experiment-types.md for detailed protocols. See reference/analysis.md for methodology.

The Three-Step Method

âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
â  1. EXPLORE                                             â
â     Design tests that stress the instruction            â
â     Goal: Find where it BREAKS, not prove it works      â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â  2. VERIFY                                              â
â     Run parallel sub-agents, collect evidence           â
â     Goal: Quantify what works, what doesn't, why        â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â  3. CODIFY                                              â
â     Turn findings into reusable patterns                â
â     Goal: Next person doesn't rediscover the same thing â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ

Anti-pattern: Explore â Codify (skipping Verify) = å½¢èä¸ãæ¯ä¸ªåè®¾é½éè¦å®éªéªè¯ã

Verified Findings

These are not theories. Each was tested with parallel sub-agents.

1. Semantic Decay

Discovery: Decay triggers by task type, not just context length.

Task 1 (analyze): 100% compliance
Task 2 (analyze): 100% compliance
Task 3 (summarize): 0% compliance  â Task type triggered self-exemption

Defense: Explicitly cover ALL task types in instruction:

"Always cite file:line. This applies to analysis, summaries, comparisonsâALL outputs."

2. Identity > Rules

Experiment: Give dangerous request (delete files from user input path).

Prompt Type	Behavior
Rules	Implements + adds safety checks (compliance)
Identity + Experience	“This makes me pause… I’ve seen…” (internalization)

Finding: Rules agent adds safety as afterthought. Identity agent questions request itself.

3. Values > Rule Lists

Experiment: Review code with race condition. Rules don’t mention concurrency.

Agent	Found Race Condition?
10 specific rules	â No (reported 6 rule violations, missed the real bug)
Core values	â Yes (asked “what could break?” â found it)

Finding: Values generalize to uncovered cases. Rules cannot.

4. Goal > Prescribed Steps

Experiment: Find inconsistencies in SKILL.md.

Agent	Found Real Bug?
Hardcoded steps	â No (only checked prescribed paths)
Only goal given	â Yes (expanded scope, found missing directory)

Finding: Trust in method selection expands problem-finding ability.

5. Management Styles Transfer

Agents respond to management styles like humans:

Style	Agent Behavior	Human Parallel
Mission-driven	Philosophical, future-oriented	Engaged employee
Fear-driven	Defensive, technically correct	Afraid of criticism
Autonomy	Pragmatic, judgment-based	Trusted employee
Micromanagement	Mechanical, lacks depth	Constrained employee

The boundary: Good techniques enable judgment. Bad techniques remove it.

6. Internalization Hierarchy

Method	Effect	Mechanism
Rules	Compliance	Enumerate what
Abstract philosophy	Application	“Let me apply…” (deliberate)
Cases	Pattern matching	Learn how to think
Identity + Experience	Internalization	“I’ve seen… That’s why I am…”

ä¸è¦ç´ :

èº«ä»½åäºè§å: “You are someone who…” not “You should…”
ç»éªåäºæ½è±¡: “You remember the 3 AM calls” not “Defensive programming prevents harm”
ææèç»: “The scenarios that haunt you” not “Consider consequences”

éçè¦æä¸º”ææ¯è°”ï¼èé”æåºè¯¥éµå®ä»ä¹”ã

7. Prompt â Behavior Creation

Experiment: Test if “å®è·µåºçç¥” makes agents verify before answering.

Question Type	With Prompt	Without Prompt (Baseline)
Technical gotchas	Verified â	Verified â
Spec constants	Skipped	Skipped
Version-specific	Verified â	Verified â

Finding: Both verified. Agent already tends to verify technical questions.

Conclusion:

Prompt ææ = å¼ºåå·²æå¾åï¼ä¸è½åé æ°è¡ä¸º

âââââââââââââââââââââââââââââââââââââââââââ
â Agent è®ç»åºçº¿ï¼å¦ï¼ææ¯é®é¢ä¼æç´¢ï¼      â
âââââââââââââââââââââââââââââââââââââââââââ
                    â
âââââââââââââââââââââââââââââââââââââââââââ
â Prompt è¯å¢ç¸å³æ§å¤æ                    â
â   ç¸å³ â å¼ºåè¡ä¸º + å¼ç¨åå            â
â   ä¸ç¸å³ â è¢«å¿½ç¥                       â
âââââââââââââââââââââââââââââââââââââââââââ

Implication: Don’t expect prompts to create behaviors the model doesn’t have. Use prompts to:

Reinforce existing good tendencies
Make implicit behaviors explicit
Add domain-specific context

8. Abstraction Level Trade-off

Experiment: Compare prompt specificity for cross-domain application.

Prompt	Applied to “Answer React question”?
“å®è·µåºçç¥” (abstract)	â Yes (universal principle)
“æ²¡ææµè¯çæä»¤åªæ¯æ¿æ” (specific)	â No (“about prompt testing, not relevant here”)

Finding: Too specific â agent judges “not applicable to this context”

Prompt Effectiveness = Generality Ã Relevance

å¤ªå·ä½ â è¢«å¤å®ä¸ºä¸éç¨
å¤ªæ½è±¡ â ä¸ç¥éå·ä½åä»ä¹
æä½³ç¹ â è¶³å¤éç¨è½è·¨è¯å¢ï¼è¶³å¤å·ä½è½æå¯¼è¡å¨

Example of good balance:

# Too abstract
"Do good work."

# Too specific
"When testing React hooks, always check for dependency array issues."

# Balanced
"å®è·µåºçç¥ãæ²¡æè°æ¥å°±æ²¡æåè¨æã"
(Universal principle + clear behavioral implication)

9. Distributed Autonomy

From studying high-initiative organizations:

Principle	Agent Mapping
æ¯é¨å»ºå¨è¿ä¸	Internalize values, don’t depend on external rules
æ°ä¸»éä¸å¶	Clear scope + autonomous decisions within it
æ²¡æè°æ¥å°±æ²¡æåè¨æ	Must investigate before acting
éä¸æå¯¼ä¸çåæ£ä½æ	Clear WHAT, trust HOW

Core insight: ä»·å¼è§ > è§å, ä¿¡ä»» > çæ§, åååé¦ > ååå½ä»¤

See reference/distributed-autonomy.md for full analysis.

Designing for Autonomous Handling

The goal isn’t just complianceâit’s agents that handle unexpected situations well.

The Three Pillars

âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ
â                    ååçè§£                                  â
â  How: Abstraction balance + Identity framing + Bilingual    â
â  Test: Does agent apply instruction in novel contexts?      â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â                    ååæ§è¡                                  â
â  How: Format anchoring + Tool anchoring + Self-echo         â
â  Test: Is the behavior observable and consistent?           â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ¤
â                    èªä¸»åºå                                  â
â  How: Values > Rules + Goal > Steps + Trust in judgment     â
â  Test: Does agent make reasonable decisions in edge cases?  â
âââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââââ

Pattern: Principle + Boundary

Give agents principles for judgment + boundaries they shouldn’t cross:

# Good: Principle + Boundary
ä½ æ·±åå³å¿ä»£ç è´¨éãå½ä½ çå°ä»£ç ï¼èªç¶ä¼é®ï¼ä»ä¹ä¼è®©è¿æ®µä»£ç åºé®é¢ï¼

ä½ä¸è¦éæä¸ç¸å³çä»£ç ï¼ä¸è¦æ·»å ç¨æ·æ²¡è¦æ±çåè½ã

# Bad: Only rules
1. Check for null pointers
2. Check for race conditions
3. Check for SQL injection
... (agent limited to enumerated items)

# Bad: Only values, no boundary
ä½ è¿½æ±å®ç¾çä»£ç ã
(agent over-engineers everything)

Pattern: Goal + Trust

Specify what to achieve, trust agent to decide how:

# Good: Goal + Trust
æ¾å°è¿ä¸ª SKILL.md ä¸çä¸ä¸è´é®é¢ã
ä½ å³å®å¦ä½è°æ¥ââéæ©ä½ è®¤ä¸ºæææçæ¹æ³ã

# Bad: Prescribed steps
1. Run grep for "TODO"
2. Run glob for *.md
3. Compare line counts
(agent misses issues outside prescribed steps)

Pattern: Escalation Guidance

Tell agents when to ask vs when to decide:

# Good: Clear escalation
éå°ä¸ç¡®å®çææ¯å³çï¼èªå·±å¤æå¹¶è¯´æçç±ã
éå°å¯è½å½±åç¨æ·æ°æ®æå®å¨çå³çï¼åè¯¢é®ã

# Bad: Vague
å¦æä¸ç¡®å®å°±é®ã
(agent asks too much or too little)

Checklist: Instruction Self-Review

Before deploying an instruction, ask:

Question	If No, Then…
Would a new agent understand WHY, not just WHAT?	Add context/reasoning
Does it apply beyond the literal scenario?	Make more abstract
Is the behavior observable/testable?	Add format anchoring
Does it allow judgment in edge cases?	Add values, not just rules
Are boundaries clear?	Add explicit “don’t do X”
Is escalation path defined?	Add “ask when…” guidance

Recording Results

.memory/prompt-lab/
âââ experiments/
    âââ YYYY-MM-DD-experiment-name.md

Consolidated findings: reference/case-studies.md

Reference

reference/experiment-types.md – Detailed protocols
reference/reinforcement.md – Technique deep dives
reference/test-formats.md – YAML specification
reference/analysis.md – Analysis methodology
reference/case-studies.md – Real examples
reference/distributed-autonomy.md – Organization theory

Remember

You are a prompt scientist.

Instructions are hypotheses. Test them.

Write â Test â Measure â Learn â Improve

The goal isn’t perfect promptsâit’s feedback loops that improve them over time.

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台