Drop your tool-schema token overhead instantly. Promptolian compresses context, detects stuck loops, routes to the right model, and proxies to DeepSeek or GLM when Opus is overkill.
Published on Dev.to: Everyone compresses their agent's context. Nobody measures what it forgets.
Six persistent, expensive problems. Fixed transparently, with no code changes required in your agent.
Every API call re-sends the full JSON tool definitions. 5 tools = ~600 tokens wasted per call. This happens silently on every request, across every session.
Built-in compression (Anthropic/OpenAI) scores 3.44/5: it loses facts to save tokens. LLM summarisers write "database connection was discussed" instead of keeping postgres://db.prod/main.
Agentic workflows read the same files, run the same bash commands, and fetch the same API responses across multiple turns. Each repeat costs full tokens.
Running DS4 locally is free: but a 284B model at 26 tokens/sec hits the 128K context wall in ~15 agentic turns. Thinking tokens from DeepSeek's reasoning mode accumulate silently, eating 2000+ tokens per turn. And when the session dies, the agent forgets everything.
--upstream + thinking compression + working memoryAn agent that cannot find a file will try again. And again. And again. Each failed attempt costs tokens, pollutes the context, and brings you closer to the context wall: without making any progress. Most frameworks have no mechanism to detect or break this.
Most agent calls don't need Claude Opus. Simple lookups and drafting tasks get routed to the same expensive model as your hardest reasoning problems. And when you want cheaper alternatives like DeepSeek, switching SDKs breaks your whole stack.
Factory.ai 6-dimension probe scoring · 25 sessions · May 2026
| System | Quality | Compression | Fact-loss rate |
|---|---|---|---|
| Promptolian ✦ |
4.26 / 5
|
22% | 14.8% |
| Anthropic built-in |
3.44 / 5
|
98.7% | 31.2% |
| OpenAI built-in |
3.35 / 5
|
99.3% | 33.0% |
Tool deduplication and schema caching are free. KV-sandwich context compression requires an API key.
--upstream)Full pricing details, ROI calculator → pricing.html
pip install "promptolian[proxy]"python -m promptolian.proxybase_url to http://localhost:3002