by fkiene · MCP Server · ★ 85
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
llmtrim llmtrim is a local proxy that compresses your LLM API requests so you pay less, with no change to the answers. It sits between your AI tools and the provider, strips the wasted tokens out of every request, and forwards it on. You get the same answers for a smaller bill. −31% input and −74% output tokens, measured live across 112 A/B cases, with no change in answer quality. What it does • See it in action • Get started • CLI & library • Works with • Configuration • Numbers What it actually do
| Stars | 85 |
| Forks | 5 |
| Language | Rust |
| Category | MCP Server |
| License | MPL-2.0 |
| Quality Score | 50.726/100 |
| Open Issues | 2 |
| Last Updated | 2026-06-20 |
| Created | 2026-06-07 |
| Platforms | claude-code, mcp, rust |
| Est. Tokens | ~27k |
These tools work well together with llmtrim for enhanced workflows:
Looking for a llmtrim alternative? If you're comparing llmtrim with other mcp server tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Unlock 2x more Claude Code and Codex usage
Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessio
A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% without los
The context intelligence layer for AI coding agents. Compressing noise, routing content to the right strategy,
Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligen
🏛 [UNDER CONSTRUCTION] A (roman) claude plugin marketplace
Explore other popular mcp server tools:
llmtrim is Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted tokens from prompts, history, tool output, and code before they're sent: -31% input / -74. It is categorized as a MCP Server with 85 GitHub stars.
llmtrim is primarily written in Rust. It covers topics such as agentic-coding, ai, anthropic.
You can find installation instructions and usage details in the llmtrim GitHub repository at github.com/fkiene/llmtrim. The project has 85 stars and 5 forks, indicating an active community.
llmtrim is released under the MPL-2.0 license, making it free to use and modify according to the license terms.
The top alternatives to llmtrim on Agent Skills Hub include headroom-desktop, prompt-caching, TokenTamer. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.