Token Guard π°
Stop burning money while you sleep.
Your AI agent runs 24/7. Opus costs $75/M output tokens. One overnight coding session can burn $50+ before you wake up. Token Guard watches your spend and acts before your wallet notices.
The Problem
Real costs from real users:
| Scenario | Model | Cost | |---|---|---| | "Let the agent run overnight" | Opus | $50-100 | | "Register an email + X account" | Opus | $55 (one task!) | | First week exploring | Opus | $100+ | | Daily heartbeat checks | Opus | $2-5/day for nothing |
Most users don't realize they're burning Opus-level tokens on tasks that Haiku could handle.
What Token Guard Does
Monitor
- Track token usage per day/model
- Estimate costs from model pricing
- Show usage history and trends
Budget
- Set daily spending limits
- Warning alerts at configurable threshold (default 80%)
- Hard limits with auto-model-downgrade
Optimize
- Model cost comparison at a glance
- Routing recommendations (Opus for reasoning, Haiku for background)
- Per-task cost estimation before running
Quick Start
# See current usage and model costs
bash scripts/token-guard.sh status
# Set a $10/day budget with warnings
bash scripts/token-guard.sh set-budget 10
# Set a hard limit: auto-downgrade to Haiku when exceeded
bash scripts/token-guard.sh set-budget 10 80 true claude-haiku-3-5
# Estimate cost before running a task
bash scripts/token-guard.sh estimate claude-opus-4 50000 10000 10
# Check usage history
bash scripts/token-guard.sh history 7
Model Cost Comparison
| Model | Input $/1M | Output $/1M | Relative | |---|---|---|---| | Claude Opus 4 | $15.00 | $75.00 | 100x | | Claude Sonnet 4 | $3.00 | $15.00 | 20x | | Claude Haiku 3.5 | $0.80 | $4.00 | 5x | | GPT-4o | $2.50 | $10.00 | 13x | | Gemini Flash | $0.075 | $0.30 | 1x | | DeepSeek | $0.27 | $1.10 | 1.5x |
Recommended Routing
Tier 1 (Reasoning/Creative): claude-opus-4 β Use sparingly
Tier 2 (Daily work): claude-sonnet-4 β Primary model
Tier 3 (Background/Subagent): claude-haiku-3-5 β Subagent model
Tier 4 (Bulk/Heartbeat): gemini-2.0-flash β Heartbeat/cron
Savings: Routing background tasks to Haiku instead of Opus = 95% cost reduction on those tasks.
For AI Agents
Add to your heartbeat or cron:
bash /path/to/token-guard/scripts/token-guard.sh monitor
When budget is exceeded with hard limit enabled, the script automatically downgrades the primary model.
Install
clawdhub install token-guard
# or: git clone https://github.com/jzOcb/token-guard
Requirements
bash4+python3curl
Related
- config-guard β Config validation and auto-rollback
- upgrade-guard β Safe upgrades with snapshot and auto-rollback
- agent-guardrails β Code-level enforcement for AI agents