Token Optimization Guide

Version: 2.1.0 Last Updated: 2026-02-03 Purpose: Reduce token usage through official and experimental methods

Official Token Optimization Methods

These methods are officially supported by Claude Code and recommended for all users:

1. Use `/clear` Between Unrelated Tasks

Reset conversation context when switching tasks:

/clear

This prevents context accumulation from previous tasks.

2. Move Domain Knowledge to Skills

Skills (.claude/skills/) load on-demand when invoked:

.claude/skills/
├── coding-guidelines/SKILL.md    # Loads when invoked
├── security-audit/SKILL.md       # Loads when invoked
└── api-design/SKILL.md           # Loads when invoked

3. Delegate to Subagents

Subagents run with separate context, reducing main conversation load:

Use the Task tool to investigate the authentication system.

4. Offload Validation to Hooks

Move repetitive validation to hooks instead of conversation context:

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "Bash",
      "hooks": [{ "type": "command", "command": "validate-command.sh" }]
    }]
  }
}

5. Write Specific Prompts

Specific prompts require less context disambiguation:

# Good - Specific
Fix the null pointer exception in src/auth/login.ts:42

# Bad - Vague (requires more context)
Fix the bug in the login system

6. Use `permissions.deny` for Security

Block access to unnecessary files via official settings:

{
  "permissions": {
    "deny": [
      "Read(**/node_modules/**)",
      "Read(**/dist/**)",
      "Read(**/build/**)"
    ]
  }
}

Experimental Methods (Not Official)

The following methods use features that are NOT officially supported:

Important Notice: .claudeignore Status

WARNING: .claudeignore is NOT an official Claude Code feature.

GitHub Issue: #579 - .claudeignore feature request
Status: Feature is still being developed
Behavior: May work in some versions but is not guaranteed

Official Alternative: permissions.deny

For security-related file exclusions, use the official permissions.deny in settings.json:

{
  "permissions": {
    "deny": [
      "Read(./.env)",
      "Read(**/secrets/**)",
      "Read(**/*.pem)",
      "Read(**/*.key)"
    ]
  }
}

Note: permissions.deny is designed for security (blocking access to sensitive files), not for token optimization. Files blocked by permissions.deny are excluded from file discovery and search results.

Migration Strategy

This project has migrated security-related exclusions to settings.json:

global/settings.json - Global security rules
project/.claude/settings.json - Project-specific security rules

.claudeignore files are now deprecated but retained for potential token optimization if the feature becomes official.

Overview

This guide explains how claude-config optimizes token usage through .claudeignore files (deprecated) and permissions.deny (official), reducing initial session costs from ~50,000 tokens to ~15,000-20,000 tokens.

Problem Statement

Before Optimization

Without .claudeignore files, Claude Code loads:

Category	Token Usage	Needed?
Core configuration	~5,000	✅ Always
Reference documents	~18,000	❌ On demand only
.npm-cache	~8,000	❌ Duplicate content
Plugin marketplace	~15,000	❌ Rarely used
Session memory	~5,000	❌ Past conversations
Commands/Skills	~10,000	❌ Load when invoked
Total	~61,000	Only ~8% needed

Impact

High API costs: Every session wastes ~46,000 tokens ($0.046-$0.138 per session)
Slow startup: Large context takes longer to process
Context pollution: Rarely-used content occupies valuable context window
No on-demand loading: Reference materials loaded even when unused

Solution: Layered Approach

Strategy

This project uses a layered approach for file exclusion:

permissions.deny (Official): Security-focused file blocking
- Blocks access to sensitive files (.env, secrets, credentials)
- Officially supported by Claude Code
- Prevents both reading and context inclusion
.claudeignore (Deprecated/Experimental): Token optimization
- May reduce token usage by excluding large files
- Not officially supported - behavior may change
- Retained for potential future support

What to Exclude

Reference documents: Load only when explicitly needed
Cache directories: Exclude duplicate/generated content
Plugin marketplace: Very large, rarely accessed
Session memory: Past conversations not needed in new sessions
Commands/Skills: Auto-loaded when invoked

Implementation

Three .claudeignore files optimize different scopes:

1. Global Configuration (`global/.claudeignore`)

Excludes user-specific content:

# Session memory (past conversations)
projects/*/session-memory/

# Plugin cache
plugins/cache/

# Backups
backup_*/

# Plugin marketplace (very large)
plugins/marketplaces/

# Plans
plans/

# Commands (load when invoked)
commands/

# Skills (load when invoked)
skills/

Token savings: ~37,000 tokens (65%)

2. Project Configuration (`project/.claudeignore`)

Excludes project-specific unnecessary content:

# NPM cache (duplicate files)
.npm-cache/

# Reference documents (load on demand)
rules/*/reference/
skills/*/reference/

# Agents (load when needed)
agents/

# Settings files (user-specific)
settings.json
settings.local.json

# Documentation files
README.md
README.ko.md

Token savings: ~38,500 tokens (68%)

3. Plugin Configuration (`plugin/.claudeignore`)

Excludes plugin development artifacts:

# NPM cache
.npm-cache/

# Node modules
node_modules/

# Build outputs
dist/
build/

# Settings
settings.json
.env

# Documentation
README.md
docs/

Token savings: ~17,500 tokens (60%)

After Optimization

Token Usage Breakdown

Category	Before	After	Savings
Core configuration	~5,000	~5,000	0
Reference documents	~18,000	0	18,000
.npm-cache	~8,000	0	8,000
Plugin marketplace	~15,000	0	15,000
Session memory	~5,000	0	5,000
Commands/Skills	~10,000	0	10,000
Total	~61,000	~5,000	~56,000 (92%)

Real-World Results

Measured on claude-config project (2026-03-21):

Metric	Before (no frontmatter)	After (`alwaysApply: false`)	Improvement
Always-loaded rules	11 files, ~105KB (~26,200 tokens)	5 files, ~4KB (~1,010 tokens)	96% reduction
Initial load (rules + config)	~122KB (~30,500 tokens)	~17KB (~4,300 tokens)	86% reduction
Conditional rules	0 files	28 files (~208KB, on demand)	Loaded only when relevant

Key finding: paths frontmatter alone does NOT prevent loading. alwaysApply: false is required alongside paths for conditional loading to work.

Using Reference Documents

Reference documents are excluded by default but easily accessible when needed.

Method 1: Explicit File Path

Can you review the label definitions in rules/workflow/reference/label-definitions.md?

Claude Code will load the specific file when you reference it.

Method 2: @load Directive

@load: reference/label-definitions, reference/automation-patterns

Help me set up issue labels.

Load multiple reference files at once.

Method 3: Ask to Load

I need help with GitHub issue labeling. Please load the relevant reference documentation.

Claude Code will identify and load appropriate reference files.

File-by-File Token Estimation

Reference Documents

File	Tokens	Purpose
`rules/workflow/reference/label-definitions.md`	~4,000	GitHub label standards
`rules/workflow/reference/automation-patterns.md`	~6,000	GitHub Actions, gh CLI
`rules/workflow/reference/issue-examples.md`	~8,000	Issue splitting, templates
Total	~18,000	Load on demand only

Cache Directories

Directory	Tokens	Why Excluded
`.npm-cache/`	~8,000	Duplicate package files
`plugins/cache/`	~3,000	Temporary plugin data
Total	~11,000	Unnecessary duplicates

Plugin Marketplace

Component	Tokens	Why Excluded
Official plugins	~10,000	Large, rarely accessed
External plugins	~5,000	Third-party code
Total	~15,000	Not needed in sessions

Rollback Instructions

If you encounter issues or need all content loaded:

Option 1: Modify permissions.deny

Edit settings.json to remove or comment out specific deny rules:

{
  "permissions": {
    "deny": [
      // "Read(./.env)"  // Commented out to allow access
    ]
  }
}

Option 2: Delete .claudeignore Files (Deprecated)

rm global/.claudeignore
rm project/.claudeignore
rm plugin/.claudeignore

Restart your session - all content will load.

Option 2: Temporary Override

For a single session, rename the files:

mv global/.claudeignore global/.claudeignore.bak

Restore after session:

mv global/.claudeignore.bak global/.claudeignore

Option 3: Selective Re-enabling

Comment out specific patterns in .claudeignore:

# Temporarily re-enable reference docs
# rules/*/reference/

Troubleshooting

Issue: Missing Information

Symptom: Claude Code doesn't have context it needs.

Solution: Reference the specific file or use @load directive:

@load: reference/label-definitions

Help me with GitHub labels.

Issue: Slower Responses

Symptom: Claude Code takes longer to respond.

Solution: You may have loaded too much content. Restart session to reset context.

Issue: No Token Savings

Symptom: Token usage still high after installing .claudeignore.

Solution:

Verify .claudeignore files exist: ls -la {global,project,plugin}/.claudeignore
Check file contents for syntax errors
Restart Claude Code session
Use /token-usage to verify current usage

Tier Preset Impact

Skills whose SKILL.md body exceeds 5 KB declare tier presets in their frontmatter. The caller selects a tier at invocation time to dial the context budget up or down for the same workflow, trading reference depth and verification cost against tokens loaded.

Authority: global/skills/_policy.md §Tier Preset Schema defines the canonical field names and semantics. This section reports empirical impact.

Overview

Tier	`ref_docs`	`deep_checks`	`max_files`
`light`	None — SKILL.md body only	`false`	Narrow
`standard`	Baseline reference set	`false`	Default
`deep`	Full reference set	`true`	Expanded

ref_docs keys are skill-defined aliases (core, advanced, batch, etc.) resolved against the skill's own reference/ directory. See _policy.md for the authoritative schema.

Baseline Measurements

Measured against pr-work at full-body load (issue #401 figures):

Tier	Tokens	Delta vs. `standard`
`light`	~1,500	−72%
`standard`	~5,350	baseline
`deep`	~6,530	+22%

Frontmatter tier-selection logic adds a one-time +150–300 tokens per tiered skill, independent of invocation tier.

Weighted session average: −39% token reduction, using the projected invocation mix:

Tier	Share
`light`	40%
`standard`	50%
`deep`	10%

Default Tier Rationale

default_tier: standard preserves existing behavior byte-for-byte. Current workflows need no opt-out — they continue to load the baseline reference set as before. Callers explicitly opt into light for quick, scoped tasks or deep when exhaustive checks are warranted.

Invocation

Pass the tier as a flag when invoking a tiered skill:

/<skill> --tier=light
/<skill> --tier=standard
/<skill> --tier=deep

Omitting --tier selects the skill's declared default_tier. Unknown tier values fall back to default_tier with a warning.

Tiered Skills

pr-work
issue-work
fleet-orchestrator
harness
release

Additional skills will adopt the schema as their bodies cross the 5 KB threshold or as workflow demand justifies differentiated loading.

Follow-up

A/B measurement is scheduled post-rollout (tracked in issue #401):

20 test runs before/after to validate the −39% weighted projection
Dedicated light-tier PR review test case to verify the target token budget
Re-measurement of pr-work and issue-work under realistic workloads

Results will be appended to this section once collected.

Harness Routing Audit

Harness-layer CLAUDE.md files are loaded on every session regardless of task, so procedural content paid for once lives forever in that always-on budget. scripts/validate_skills.sh enforces a routing-only discipline on the three harness files.

Audited files

global/CLAUDE.md
project/CLAUDE.md
enterprise/CLAUDE.md

Classification

For each non-blank, non-heading line outside the footer (first ---):

Line shape	Counted as
`- ...`, `* ...`, `+ ...` (bullet)	routing
`\| ...` (table row)	routing
`@./...` (import directive)	routing
`> ...` (blockquote)	routing
Any line inside a heading whose text matches `[Ii]nvariant`	routing/invariant
Anything else (free-form sentences)	prose

Threshold

Fail the build if prose / (routing + prose) > AUDIT_PROSE_RATIO (default 0.30).

# Override for experimentation
AUDIT_PROSE_RATIO=0.40 ./scripts/validate_skills.sh

Current measurements (post-harness-diet):

File	Prose	Routing/Invariant	Ratio
`global/CLAUDE.md`	2	11	15.4%
`project/CLAUDE.md`	9	26	25.7%
`enterprise/CLAUDE.md`	0	3	0.0%

Intent

Keep short always-on invariants inside an ## Always-on Invariants heading — they count as routing regardless of shape, because the block's purpose is guardrails.
Move procedural prose (how-to, workflow detail) into docs/**, global/skills/**/SKILL.md, or project-level rules/**/*.md.
Prefer a routing bullet that points to the existing file over a prose paragraph that duplicates it.

Related frontmatter drift checks

The same validator flags inconsistency between skill frontmatter contracts and body:

max_iterations or halt_condition declared → body must mention loop|retry|iteration|poll
loop_safe: false declared → body must document side effects / non-idempotent behavior

Failures here are warnings (non-fatal) to avoid blocking incremental skill authoring, but CI surfaces the count.

Best Practices

1. Start Lean

Always start sessions with optimized context:

Core configuration only (~5,000 tokens)
Load reference docs when needed

2. Use @load Sparingly

Only load reference documents when actively using them:

# Good: Load when needed
@load: reference/label-definitions
Help me create issue labels.

# Bad: Load everything upfront
@load: reference/*
Generic question about the project.

3. Regular Cleanup

Periodically remove unnecessary files:

# Remove .npm-cache
rm -rf .npm-cache/

# Remove old backups
rm -rf backup_*/

4. Monitor Token Usage

Track token consumption:

Use /token-usage command
Check Claude Code UI
Review API billing dashboard

Cost Savings Calculator

Individual Developer

Sessions/Month	Before	After	Monthly Savings
50	$6.90	$2.25	$4.65
100	$13.80	$4.50	$9.30
200	$27.60	$9.00	$18.60

Team (5 Developers)

Sessions/Month/Person	Before	After	Monthly Savings
100	$69.00	$22.50	$46.50
200	$138.00	$45.00	$93.00

Enterprise (50 Developers)

Sessions/Month/Person	Before	After	Monthly Savings
200	$1,380.00	$450.00	$930.00
500	$3,450.00	$1,125.00	$2,325.00

Assumes $0.138 per 1,000 input tokens (Claude Sonnet 4.5 pricing)

Implementation Checklist

When installing claude-config:

Run ./bootstrap.sh to install .claudeignore files
Verify .claudeignore files exist in global/, project/, plugin/
Remove existing .npm-cache/ directories
Restart Claude Code session
Verify token reduction using /token-usage
Bookmark this guide for reference document loading

FAQ

Q: Is .claudeignore an official Claude Code feature?

A: No. As of February 2026, .claudeignore is not officially supported. See GitHub Issue #579. Use permissions.deny in settings.json for security-related file exclusions.

Q: What's the difference between permissions.deny and .claudeignore?

permissions.deny: Official feature for security - blocks tool access to sensitive files
.claudeignore: Experimental/unsupported - may reduce token usage but behavior is not guaranteed

Q: Will this break my workflow?

A: No. Core functionality remains unchanged. Reference documents load on-demand when you explicitly reference them.

Q: How do I access excluded content?

A: Use explicit file paths, @load directive, or ask Claude to load relevant documentation.

Q: Can I customize .claudeignore?

A: Yes, but remember it's deprecated. Edit the files to add/remove patterns. Follow gitignore syntax.

Q: What if I need everything loaded?

A: Remove deny rules from settings.json or delete .claudeignore files and restart your session.

Q: Does this affect code generation quality?

A: No. Core rules and guidelines remain loaded. Reference materials load when needed.

Support

For issues or questions:

GitHub Issues: https://github.com/kcenon/claude-config/issues
Documentation: https://github.com/kcenon/claude-config
Discussion: GitHub Discussions

Token optimization: Because every token counts.

FilesExpand file tree

TOKEN_OPTIMIZATION.md

Latest commit

History

TOKEN_OPTIMIZATION.md

File metadata and controls

Token Optimization Guide

Official Token Optimization Methods

1. Use /clear Between Unrelated Tasks

2. Move Domain Knowledge to Skills

3. Delegate to Subagents

4. Offload Validation to Hooks

5. Write Specific Prompts

6. Use permissions.deny for Security

Experimental Methods (Not Official)

Important Notice: .claudeignore Status

Official Alternative: permissions.deny

Migration Strategy

Overview

Problem Statement

Before Optimization

Impact

Solution: Layered Approach

Strategy

What to Exclude

Implementation

1. Global Configuration (global/.claudeignore)

2. Project Configuration (project/.claudeignore)

3. Plugin Configuration (plugin/.claudeignore)

After Optimization

Token Usage Breakdown

Real-World Results

Using Reference Documents

Method 1: Explicit File Path

Method 2: @load Directive

Method 3: Ask to Load

File-by-File Token Estimation

Reference Documents

Cache Directories

Plugin Marketplace

Rollback Instructions

Option 1: Modify permissions.deny

Option 2: Delete .claudeignore Files (Deprecated)

Option 2: Temporary Override

Option 3: Selective Re-enabling

Troubleshooting

Issue: Missing Information

Issue: Slower Responses

Issue: No Token Savings

Tier Preset Impact

Overview

Baseline Measurements

Default Tier Rationale

Invocation

Tiered Skills

Follow-up

Harness Routing Audit

Audited files

Classification

Threshold

Intent

Related frontmatter drift checks

Best Practices

1. Start Lean

2. Use @load Sparingly

3. Regular Cleanup

4. Monitor Token Usage

Cost Savings Calculator

Individual Developer

Team (5 Developers)

Enterprise (50 Developers)

Implementation Checklist

FAQ

Q: Is .claudeignore an official Claude Code feature?

Q: What's the difference between permissions.deny and .claudeignore?

Q: Will this break my workflow?

Q: How do I access excluded content?

Q: Can I customize .claudeignore?

Q: What if I need everything loaded?

Q: Does this affect code generation quality?

1. Use `/clear` Between Unrelated Tasks

6. Use `permissions.deny` for Security

1. Global Configuration (`global/.claudeignore`)

2. Project Configuration (`project/.claudeignore`)

3. Plugin Configuration (`plugin/.claudeignore`)