token-optimizer

📁 alexismunoz1/token-optimizer 📅 5 days ago

总安装量

周安装量

#62183

全站排名

安装命令

npx skills add https://github.com/alexismunoz1/token-optimizer --skill token-optimizer

Agent 安装分布

opencode 3

gemini-cli 3

antigravity 3

claude-code 3

github-copilot 3

codex 3

Skill 文档

Token Optimizer

Practical guide to reduce token consumption, lower costs, and get faster responses from Claude Code. Every recommendation is backed by real experiment data (see references/metrics-report.md).

Quick Wins Checklist

Apply these in order of impact:

Split large files (>150 lines) into focused modules â saves 18%+ tokens on focused tasks
Optimize your CLAUDE.md â can reduce consumption 50-70%
Use /clear between tasks â eliminates irrelevant context
Use /compact in long conversations â compresses history
Use the right model â Haiku costs 18x less than Opus for simple tasks
Limit active MCPs to â¤10 â fewer tools = less overhead per call
Use subagents for verbose tasks â output stays in subagent context

File Organization Rules

Small, focused files are the single highest-impact optimization. Our experiment showed that modular code reduced tokens by 18.2% and noise by 92% on focused tasks (the majority of daily work).

Core Rules

Maximum 150 lines per file â if longer, split by responsibility
Single responsibility â one concern per file
Descriptive names in kebab-case â the filename should tell the AI exactly what’s inside

Naming Convention

Avoid	Prefer
`utils.ts`	`string-utils.ts`, `date-utils.ts`
`helpers.ts`	`date-formatting-helpers.ts`
`api.ts` (all endpoints)	`users-api.ts`, `products-api.ts`
`index.ts` (with logic)	`user-authentication.ts`
`data.ts`	`product-catalog-data.ts`

Why It Works

When the AI needs to fix an email validation bug:

Monolithic (814 lines): reads entire file â 49,466 tokens
Modular (67 lines): reads only validation-utils.ts â 40,447 tokens â 18.2% savings, 92% less noise

At scale (5,000+ lines), monolithic files become impossible to process efficiently. Modular files maintain constant size regardless of project growth.

For detailed tables and project structure templates, see references/file-organization-guide.md

Action Items

Identify files over 150 lines in your project
Split them by responsibility (one concern = one file)
Rename generic files (utils.ts, helpers.ts) to descriptive names

Optimized CLAUDE.md Template

A well-structured CLAUDE.md can reduce token consumption by 50-70%. Keep it under 500 lines â essentials only.

# Project Name

Brief one-line description of the project.

## Tech Stack

- Language: TypeScript
- Framework: Next.js 14 (App Router)
- Database: PostgreSQL with Prisma ORM
- Testing: Vitest

## Project Structure

src/           â application code
tests/         â unit tests
docs/          â documentation
components/    â UI components
services/      â business logic
utils/         â specific utilities (string-utils, date-utils, etc.)

## Key Conventions

- Files: kebab-case, max 150 lines, single responsibility
- Tests: colocated in __tests__/ folders
- Imports: absolute paths from @/

## Commands

- `npm run dev` â start dev server
- `npm test` â run tests
- `npm run lint` â lint code

## Important Patterns

- All API routes in src/app/api/
- Shared types in src/types/
- Business logic in src/services/, never in components

Key Principles

Be specific â “PostgreSQL with Prisma” not just “database”
Include structure â tell the AI where things live
List commands â save the AI from guessing
Specify what to ignore â add directories the AI should skip
Use triggers, not full docs â reference skills/files for details, don’t inline everything

Action Items

Create or audit your CLAUDE.md
Ensure it’s under 500 lines
Include project structure, commands, and key conventions
Remove any verbose documentation (move to references or skills)

Context Management

Token waste often comes from accumulated irrelevant context, not from individual operations.

Essential Commands

Command	When to Use	Effect
`/clear`	Switching tasks	Resets context completely
`/compact`	Long conversation (>50 exchanges)	Compresses history, keeps essentials
`/context`	Diagnosing high token use	Shows what’s consuming tokens

Rules of Thumb

If you’ve corrected Claude more than 2 times on the same topic â /clear and restart with a better prompt
After completing a feature â /clear before starting the next one
Conversation feeling slow â try /compact first, /clear if that doesn’t help

Lazy Loading Principle

Don’t front-load all information. Use triggers in CLAUDE.md that reference detailed docs:

## Authentication
For auth implementation details, see docs/auth-guide.md

One project achieved 54% reduction in initial tokens (7,584 â 3,434) by:

Putting specific instructions in skills that load on demand
Keeping only triggers in CLAUDE.md, not complete documentation
Principle: Claude doesn’t need all info upfront â it needs to know when to load it

For advanced context strategies, see references/context-management-guide.md

Action Items

Start using /clear between unrelated tasks
Use /compact when conversations get long
Move detailed docs out of CLAUDE.md into referenced files

Advanced Optimizations

Subagents for Verbose Tasks

Use the Task tool for operations that generate large output (test runs, builds, searches). The verbose output stays in the subagent’s context â only the summary returns to your main conversation.

Best candidates for subagents:

Running test suites
Broad codebase searches
Build/compilation tasks
Log analysis

MCP Management

Keep maximum 10 active MCPs at a time
Keep maximum 80 total tools across all MCPs
Disable MCPs not needed for the current task
Each unused MCP still costs tokens in tool descriptions

Strategic Model Selection

Task Type	Model	Why
80% of daily tasks	Sonnet	Best cost/performance ratio
Complex architecture	Opus	Deeper reasoning needed
Simple/quick tasks	Haiku	Up to 18x cheaper than Opus

Default to Sonnet. Escalate to Opus only for genuinely complex problems. Use Haiku for simple tasks, tests, and searches.

Action Items

Delegate verbose tasks to subagents
Audit active MCPs â disable unused ones
Use Sonnet as default, Haiku for simple tasks

How to Measure

Track these metrics to verify optimizations are working:

Tokens per task â check with /context before and after changes
Files read per task â fewer files read = less noise
Lines processed â compare monolithic vs modular reads
Time per response â faster responses indicate less processing

Our Experiment Results

Scenario	Monolithic	Modular	Difference
Bug fix (focused)	49,466 tokens	40,447 tokens	-18.2%
Lines processed (bug fix)	814 lines	67 lines	-92% noise
New feature (cross-cutting)	50,350 tokens	50,949 tokens	+1.2%
Refactor (cross-cutting)	49,687 tokens	51,699 tokens	+4.1%

Key insight: Focused tasks (80% of daily work) benefit enormously from modular code. Cross-cutting tasks show minimal difference at small scale but modular wins decisively at 5,000+ lines.

For the complete experiment methodology and raw data, see references/metrics-report.md

Action Items

Baseline your current token usage with /context
Apply the Quick Wins Checklist from the top
Re-measure after changes to quantify improvement

GitHub 仓库 ↗ ← 返回陌讯 Skills 聚合平台