cache-audit

📁 ussumant/cache-audit 📅 7 days ago

总安装量

周安装量

#70177

全站排名

安装命令

npx skills add https://github.com/ussumant/cache-audit --skill cache-audit

Agent 安装分布

amp 2

gemini-cli 2

github-copilot 2

codex 2

kimi-cli 2

cursor 2

Skill 文档

Prompt Cache Audit Skill

Trigger: /cache-audit or “audit my caching” or “check my cache setup” or “am I breaking the cache?”

What it does: Reads your live Claude Code configuration and checks it against the 6 prompt caching rules from Anthropic’s engineering team. Returns a scored report with specific, actionable fixes.

Reference: Based on Thariq Shihipar’s thread “Lessons from Building Claude Code: Prompt Caching Is Everything”

When Invoked

Run all checks automatically. Do not ask for confirmation. Read the relevant files and produce the full report in one pass.

The 6 Checks

Check 1 â Prompt Ordering (Static Before Dynamic)

Read: ~/.claude/settings.json, all active CLAUDE.md files in the project hierarchy

What to look for:

Does the system prompt load in the right order? Rule: static content first, dynamic content last
Correct order: System prompt â Tools â CLAUDE.md â Session context â Messages
Flag: Any dynamic content (timestamps, git status, current date, user stats) appearing in the system prompt itself
Flag: CLAUDE.md files that include session-specific or time-sensitive data
Pass: CLAUDE.md files that are purely static instructions, conventions, and file references

Scoring:

PASS: System prompt is fully static, dynamic data injected via messages
WARNING: Some dynamic data in system prompt but low-frequency change
FAIL: High-churn dynamic content (timestamps, file contents) in system prompt

Check 2 â Dynamic Updates via Messages (not System Prompt Edits)

Read: All hook files listed under SessionStart and UserPromptSubmit in ~/.claude/settings.json

What to look for:

Hooks should output dynamic data as additionalContext in their JSON response (which becomes a <system-reminder> message) â not by modifying the system prompt
Check each hook’s output format: does it use hookSpecificOutput.additionalContext? â
Flag: Any hook that writes to a system prompt file, modifies CLAUDE.md, or injects into the static prefix
Pass: Hooks that return JSON with additionalContext key

Also check:

Is currentDate injected via message (memory.md context) or hardcoded in system prompt?
Is git status coming from a hook â message, or somewhere static?

Scoring:

PASS: All hooks use additionalContext pattern
FAIL: Any hook modifies system prompt or CLAUDE.md mid-session

Check 3 â Tool Set Stability (No Add/Remove Mid-Session)

Read: ~/.claude/settings.json, ~/.claude/skills/*.md, MCP server configurations

What to look for:

Tools should be identical at every turn of the conversation
Check: Do any skills explicitly add new tools when invoked? (tool definitions that only appear after a skill runs)
Check: Are MCP tools using defer_loading: true stubs rather than full schemas loaded conditionally?
Flag: Any skill that modifies the available tool set
Pass: MCP tools present as lightweight stubs in every request, full schemas only loaded on demand via ToolSearch

Note: The ToolSearch tool itself is the correct pattern â it lets the model discover tools without adding/removing from the base set.

Scoring:

PASS: Tool set is fixed at session start, all MCP tools deferred
WARNING: Some conditional tool loading that may cause cache misses
FAIL: Skills or hooks that add/remove tools mid-conversation

Check 4 â No Mid-Session Model Switches

Read: ~/.claude/settings.json, ~/.claude/skills/*.md, any agent/team configurations

What to look for:

The model field in settings.json should be set and stable
Check: Do any skills switch models in the same conversation thread? (e.g., running a quick haiku query inline)
Pass: Model switches are done via subagents with handoff messages â the Explore/Plan agent types running on Haiku are separate conversations, not inline switches
Flag: Any pattern where the main conversation calls a different model mid-turn

Scoring:

PASS: Single model per conversation, subagents used for model delegation
FAIL: Inline model switching in same conversation thread

Check 5 â Dynamic Content Size

Read: Hook files, git status injection, session-reminder outputs

What to measure:

Estimate the size of dynamic content injected per session/turn
Check the git status hook output â measure the typical size of the injected git diff/status
Check streak/quota hook output size
Check if granola meeting sync injects large content per turn

Thresholds:

< 2k chars injected per turn: â PASS
2kâ10k chars: â ï¸ WARNING (still correct pattern, just expensive)
10k chars: â FLAG â consider trimming

Known issue to flag:

Git status with hundreds of untracked files (like the personalOS repo) can easily exceed 40k chars per session start. This is injected correctly via messages, but the raw token cost is high. Recommend: trim to branch name + changed file count + modified file list only.

Suggested fix for large git status:

# Instead of full git status, inject only:
git branch --show-current
git diff --stat HEAD | tail -5
git status --short | grep "^[^?]" | head -20  # only tracked changes, no untracked

Check 6 â Fork Safety (Compaction & Subagent Calls)

Read: Any compaction configuration, skill invocations that fork context

What to look for:

When Claude Code runs compaction (context window fills), does the summary request reuse the same system prompt + tools as the parent?
When skills fork a subagent, do they pass through the same prefix?
This is partially an Anthropic infrastructure concern â Claude Code handles it correctly by default

Scoring:

PASS: Using Claude Code’s built-in compaction (handled correctly)
MANUAL CHECK: Custom compaction or summarization flows â verify they use identical system prompt + tool definitions as parent

Output Format

After running all checks, output this report:

ââââââââââââââââââââââââââââââââââââ
  PROMPT CACHE AUDIT
ââââââââââââââââââââââââââââââââââââ

Score: X/6

â / â ï¸ / â  Rule 1 â Ordering: [PASS/WARNING/FAIL]
   â [Specific finding or "All good"]

â / â ï¸ / â  Rule 2 â Message injection: [PASS/WARNING/FAIL]
   â [Specific hooks checked and their pattern]

â / â ï¸ / â  Rule 3 â Tool stability: [PASS/WARNING/FAIL]
   â [MCP tool count, defer status]

â / â ï¸ / â  Rule 4 â Model switching: [PASS/WARNING/FAIL]
   â [Model in settings, any inline switches found]

â / â ï¸ / â  Rule 5 â Dynamic content size: [PASS/WARNING/FAIL]
   â [Estimated chars/turn for each injection point]

â / â ï¸ / â  Rule 6 â Fork safety: [PASS/MANUAL CHECK]
   â [Compaction pattern used]

ââââââââââââââââââââââââââââââââââââ
  TOP FIX
ââââââââââââââââââââââââââââââââââââ
[Single most impactful change with exact code/config to implement]

If everything passes: say so clearly and note the estimated cost savings from the current setup vs. a naive implementation.

Reference: The 6 Rules (Quick Cheatsheet)

Rule	Do	Don’t
1. Ordering	Static system prompt â CLAUDE.md â messages	Dynamic data in system prompt
2. Updates	Inject via `<system-reminder>` in messages	Edit system prompt mid-session
3. Tools	Fixed tool set + deferred stubs	Add/remove tools per turn
4. Models	One model per conversation, subagents for switches	Inline model switching
5. Size	Trim dynamic injections to minimum needed	Dump full git status (40k chars)
6. Forks	Same prefix for compaction/subagents	Different system prompt for summary calls

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台

cache-audit

Agent 安装分布

Skill 文档

Prompt Cache Audit Skill

When Invoked

The 6 Checks

Check 1 â Prompt Ordering (Static Before Dynamic)

Check 2 â Dynamic Updates via Messages (not System Prompt Edits)

Check 3 â Tool Set Stability (No Add/Remove Mid-Session)

Check 4 â No Mid-Session Model Switches

Check 5 â Dynamic Content Size

Check 6 â Fork Safety (Compaction & Subagent Calls)

Output Format

Reference: The 6 Rules (Quick Cheatsheet)

Check 1 â Prompt Ordering (Static Before Dynamic)

Check 2 â Dynamic Updates via Messages (not System Prompt Edits)

Check 3 â Tool Set Stability (No Add/Remove Mid-Session)

Check 4 â No Mid-Session Model Switches

Check 5 â Dynamic Content Size

Check 6 â Fork Safety (Compaction & Subagent Calls)