mirror of https://github.com/luongnv89/claude-howto.git synced 2026-06-05 22:36:34 +02:00

Files

T

Luong NGUYEN 0fcac18357 docs: Add blog post and new slash commands for development workflow

- Add blog post: 4 Essential Slash Commands I Use in Every Project
- Add new slash commands: /doc-refactor, /setup-ci-cd, /unit-test-expand
- Update slash-commands README with comprehensive documentation
- Simplify /push-all command structure
- Archive add-blog-post-slash-commands change
- Add blog-post spec and pending openspec changes

2025-12-26 11:02:19 +01:00

3.7 KiB

Raw Blame History

Change: Add Context Usage Tracking Documentation via Pre-Message and Post-Response Hooks

Why

Users want to monitor token consumption per request and overall context usage throughout a Claude Code session. Currently, the hooks documentation shows a basic context-usage example using the Stop hook, but it doesn't demonstrate how to track per-request token consumption by comparing measurements at two points in time.

By documenting how to use UserPromptSubmit as a "pre-message" hook and Stop as a "post-response" hook, users can calculate the delta in token usage for each request, enabling accurate per-request consumption metrics.

What Changes

ADDED: Documentation for using UserPromptSubmit and Stop hooks together for context tracking
ADDED: A new example demonstrating token delta calculation between pre-message and post-response
MODIFIED: Enhance the existing context usage reporting requirement with delta-based tracking approach
ADDED: Detailed explanation of token estimation methodology and its limitations

Impact

Affected specs: hooks-documentation
Affected code: 06-hooks/README.md (documentation updates)
No breaking changes - purely additive documentation

Technical Analysis

Current Hook Events Mapping

Desired Hook	Claude Code Event	Trigger Point
Pre-Message Hook	`UserPromptSubmit`	Before user prompt is processed by the model
Post-Response Hook	`Stop`	After model completes its full response

Token Counting Methods (Offline, No API Key)

Since we need offline token counting without an API key, we offer two local approaches:

Method 1: tiktoken with p50k_base (More Accurate)

Use OpenAI's tiktoken library with the p50k_base encoding as an approximation for Claude's tokenizer:

import tiktoken

enc = tiktoken.get_encoding("p50k_base")
tokens = enc.encode(text)
token_count = len(tokens)

Pros:

More accurate than character estimation (~90-95% accuracy)
Works completely offline
No API key required
Fast execution

Cons:

Requires tiktoken dependency (pip install tiktoken)
Not Claude's exact tokenizer (approximation)

Method 2: Character-Based Estimation (Simplest)

For zero-dependency estimation:

estimated_tokens = len(text) // 4