llmtrim — MCP Server by fkiene

by fkiene · MCP Server · ★ 85

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About llmtrim

llmtrim llmtrim is a local proxy that compresses your LLM API requests so you pay less, with no change to the answers. It sits between your AI tools and the provider, strips the wasted tokens out of every request, and forwards it on. You get the same answers for a smaller bill. −31% input and −74% output tokens, measured live across 112 A/B cases, with no change in answer quality. What it does • See it in action • Get started • CLI & library • Works with • Configuration • Numbers What it actually do

agentic-codingaianthropicclaude-codecost-reductiondeveloper-toolsllmllmopsmcpmitm-proxy

Quick Facts

Stars85
Forks5
LanguageRust
CategoryMCP Server
LicenseMPL-2.0
Quality Score50.726/100
Open Issues2
Last Updated2026-06-20
Created2026-06-07
Platformsclaude-code, mcp, rust
Est. Tokens~27k

Compatible Skills

These tools work well together with llmtrim for enhanced workflows:

  • homebrew-pandafilter — semantic(0.23)+complementary+rare_topics+same_lang+similar_pop+shared_platform (58%)
  • crab-code — semantic(0.22)+complementary+same_lang+similar_pop+shared_platform (53%)

llmtrim alternative? Top 6 similar tools

Looking for a llmtrim alternative? If you're comparing llmtrim with other mcp server tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • headroom-desktop by gglucass · ⭐ 194

    Unlock 2x more Claude Code and Codex usage

  • prompt-caching by flightlesstux · ⭐ 123

    Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessio

  • TokenTamer by borhen68 · ⭐ 112

    A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% without los

  • homebrew-pandafilter by AssafWoo · ⭐ 99

    The context intelligence layer for AI coding agents. Compressing noise, routing content to the right strategy,

  • ccproxy by starbaser · ⭐ 351

    Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligen

  • claude-emporium by Vvkmnn · ⭐ 274

    🏛 [UNDER CONSTRUCTION] A (roman) claude plugin marketplace

More MCP Server Tools

Explore other popular mcp server tools:

View all MCP Server tools →

Popular Rust Agent Tools

Frequently Asked Questions

What is llmtrim?

llmtrim is Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted tokens from prompts, history, tool output, and code before they're sent: -31% input / -74. It is categorized as a MCP Server with 85 GitHub stars.

What programming language is llmtrim written in?

llmtrim is primarily written in Rust. It covers topics such as agentic-coding, ai, anthropic.

How do I install or use llmtrim?

You can find installation instructions and usage details in the llmtrim GitHub repository at github.com/fkiene/llmtrim. The project has 85 stars and 5 forks, indicating an active community.

What license does llmtrim use?

llmtrim is released under the MPL-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to llmtrim?

The top alternatives to llmtrim on Agent Skills Hub include headroom-desktop, prompt-caching, TokenTamer. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse MCP Server tools