TokenTamer — LLM Plugin by borhen68

by borhen68 · LLM Plugin · ★ 112

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About TokenTamer

🚀 TokenTamer A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% on plain-chat coding agents. TokenTamer is a middleware proxy that sits between an AI coding agent and the LLM API. It intercepts raw payloads, parses code with AST, and replaces "background" files with structural skeletons. The agent still sees signatures, classes, and imports — it just stops paying for function bodies it isn't editing. ⚠️ Alpha software. This is a real project in active development, not a polished SaaS. Please read the support matrix below before installing. 🧪 Support Status Stale file reads in get skeletonized; latest read stays intact.

ai-coding-agentanthropiccontext-compressioncost-reductiondeveloper-toolsllmopenaiproxypythontoken-optimization

Quick Facts

Stars112
Forks19
LanguagePython
CategoryLLM Plugin
LicenseMIT
Quality Score56.562/100
Open Issues2
Last Updated2026-06-15
Created2026-06-08
Platformspython
Est. Tokens~14k

Compatible Skills

These tools work well together with TokenTamer for enhanced workflows:

  • claw-compactor — semantic(0.45)+complementary+rare_topics+same_lang+shared_platform (60%)
  • cc-harness-skills — semantic(0.21)+complementary+rare_topics+same_lang+similar_pop+shared_platform (57%)

TokenTamer alternative? Top 6 similar tools

Looking for a TokenTamer alternative? If you're comparing TokenTamer with other llm plugin tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • llmtrim by fkiene · ⭐ 85

    Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted

  • prompt-caching by flightlesstux · ⭐ 123

    Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessio

  • anthropic-max-router by nsxdavid · ⭐ 52

    Dual API router (Anthropic + OpenAI compatible) for Claude MAX Plan - Use flat-rate billing with ANY AI too

  • entroly by juyterman1000 · ⭐ 416

    Cut your Claude / OpenAI / Gemini bill 70–95% on AI coding. Local proxy that compresses context, keeps provide

  • NadirClaw by doramirdor · ⭐ 336

    Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premi

  • LAP by Lap-Platform · ⭐ 212

    Your agents are guessing at APIs. Give them the actual Agent-Native spec. 1500+ API's Ready To-Use skills, Co

More LLM Plugin Tools

Explore other popular llm plugin tools:

View all LLM Plugin tools →

Popular Python Agent Tools

Frequently Asked Questions

What is TokenTamer?

TokenTamer is A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% without losing what the model actually needs to know.. It is categorized as a LLM Plugin with 112 GitHub stars.

What programming language is TokenTamer written in?

TokenTamer is primarily written in Python. It covers topics such as ai-coding-agent, anthropic, context-compression.

How do I install or use TokenTamer?

You can find installation instructions and usage details in the TokenTamer GitHub repository at github.com/borhen68/TokenTamer. The project has 112 stars and 19 forks, indicating an active community.

What license does TokenTamer use?

TokenTamer is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to TokenTamer?

The top alternatives to TokenTamer on Agent Skills Hub include llmtrim, prompt-caching, anthropic-max-router. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse LLM Plugin tools