llm_context_benchmarks — LLM Plugin by ivanfioravanti

by ivanfioravanti · LLM Plugin · ★ 67

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About llm_context_benchmarks

LLM Context Benchmarks Benchmark prompt-processing and generation throughput across context sizes (0.5k–128k tokens) for many inference engines: Ollama (API & CLI), MLX, MLX Distributed, MLX-VLM, llama.cpp, LM Studio, Exo, Apple Foundation Models Serve, vMLX, oMLX, Paroquant, and any OpenAI-compatible endpoint. Optimized for Apple Silicon but works anywhere Python runs. Installation Engine-specific setup: (Optional) pre-commit hooks for Black + isort: Running Benchmarks bash List engines uv run benchmark --list-engines Generate test files (only needed once) uv run generate-context-files prideandprejudice.txt Run a benchmark (engine + model) uv run benchmark mlx mlx-community/Qwen3-4B-Instruct-2507-4bit uv run

aibenchmarkingllms

Quick Facts

Stars67
Forks9
LanguagePython
CategoryLLM Plugin
LicenseApache-2.0
Quality Score55.668/100
Open Issues4
Last Updated2026-06-13
Created2025-08-06
Platformscli, python
Est. Tokens~15k

llm_context_benchmarks alternative? Top 6 similar tools

Looking for a llm_context_benchmarks alternative? If you're comparing llm_context_benchmarks with other llm plugin tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • weam by weam-ai · ⭐ 216

    Web app for teams of 20+ members. In-built connections to major LLMs via API. Share chats, prompts, and agents

  • log10 by log10-io · ⭐ 96

    Python client library for improving your LLM app accuracy

  • manim-generator by makefinks · ⭐ 85

    Automatic LLM-based video generation using the manim library. Usage of a code-writer and code-reviewer feedbac

  • moonshot by aiverify-foundation · ⭐ 315

    Moonshot - A simple and modular tool to evaluate and red-team any LLM application.

  • claude-code-tamagotchi by Ido-Levi · ⭐ 302

    Real-time behavioral enforcement for Claude Code. Monitors AI actions, detects violations, and interrupts misb

  • bocoel by rentruewang · ⭐ 289

    Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 tim

More LLM Plugin Tools

Explore other popular llm plugin tools:

View all LLM Plugin tools →

Popular Python Agent Tools

Frequently Asked Questions

What is llm_context_benchmarks?

llm_context_benchmarks is 📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized. It is categorized as a LLM Plugin with 67 GitHub stars.

What programming language is llm_context_benchmarks written in?

llm_context_benchmarks is primarily written in Python. It covers topics such as ai, benchmarking, llms.

How do I install or use llm_context_benchmarks?

You can find installation instructions and usage details in the llm_context_benchmarks GitHub repository at github.com/ivanfioravanti/llm_context_benchmarks. The project has 67 stars and 9 forks, indicating an active community.

What license does llm_context_benchmarks use?

llm_context_benchmarks is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to llm_context_benchmarks?

The top alternatives to llm_context_benchmarks on Agent Skills Hub include weam, log10, manim-generator. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse LLM Plugin tools