Agent token use optimiser

Use
less
tokens.

Tokenless is a six-phase pipeline that gives AI agents 60–92% token savings on a wide range of agentic tasks — without losing the information that matters.

Live savings estimate Running
Loaded
0%
Avoided
0%
0
tokens avoided
0%
avoidance rate
SCAN
LOCATE
ZOOM
EXTRACT
ACT
DISCARD
How it works

Six phases.
One goal.

Every call runs the same deterministic pipeline. Token budgets are enforced at each phase — no overruns, no surprises.

01
Scan
5% of budget
Detect content type. Build a ranked region index without loading the full document.
02
Locate
3% of budget
Rank regions by relevance to your query. Deduplicate overlapping sections.
03
Zoom
35% of budget
Extract windowed content from top-ranked regions only. The rest is never read.
04
Extract
12% of budget
Parse fields, bullets, and tables from zoomed content into structured output.
05
Act
45% of budget
Your agent takes action using the compact result — not the raw document.
06
Discard
~0
Raw content is collapsed from session memory. Nothing lingers in context.
Benchmarks

Real numbers.
Real documents.

Content Type Input Tokens Avoidance
0%
average token avoidance across all content types
10
content types auto-detected — no configuration required
100%
recall across all benchmark fixtures — information preserved
Download

Up in sixty
seconds.

Terminal
curl -fsSL https://tokenless.app/install.sh | sh
1
Run the install command — the binary downloads and is added to your PATH automatically.
2
Run tokenless init --global to register the MCP server with Claude Code.
3
Start a session. Your agent now reads at 60–92% fewer tokens with no code changes.