1704+ open-source llm plugin tools ranked by stars
LLM Plugin tools are open-source packages that extend AI coding agents like Claude Code, OpenAI Codex, Gemini CLI, and other AI assistants. They provide specialized capabilities ranging from code generation and debugging to API integration and workflow automation.
Agent Skills Hub indexes 1704+ llm plugin tools from GitHub, ranked by community adoption (stars), code quality scores, and compatibility with popular AI agents. The top languages in this category are TypeScript, Python, Rust, Swift, Jupyter Notebook.
| # | Skill | Stars | Lang |
|---|---|---|---|
| 1 | stagehand The SDK For Browser Agents |
★ 23.3k | TypeScript |
| 2 | promptfoo Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare p |
★ 22.8k | TypeScript |
| 3 | gorilla Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) |
★ 12.8k | Python |
| 4 | PocketFlow-Tutorial-Codebase-Knowledge Pocket Flow: Codebase to Tutorial |
★ 12.4k | Python |
| 5 | llm Access large language models from the command-line |
★ 12.1k | Python |
| 6 | llm-engineer-toolkit A curated list of 120+ LLM libraries category wise. |
★ 10.5k | |
| 7 | phoenix AI Observability & Evaluation |
★ 10.3k | Python |
| 8 | code2prompt A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, an |
★ 7.4k | Rust |
| 9 | superagent Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. |
★ 6.5k | TypeScript |
| 10 | ai-cookbook Examples and tutorials to help developers build AI systems |
★ 4.1k | Python |
| 11 | DecryptPrompt 总结Prompt&LLM论文,开源数据&模型,AIGC应用 |
★ 3.4k | |
| 12 | ax The pretty much "official" DSPy framework for Typescript |
★ 2.8k | TypeScript |
| 13 | yek A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumptio |
★ 2.5k | Rust |
| 14 | code-interpreter Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app |
★ 2.3k | Python |
| 15 | awesome-local-llm A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally |
★ 2.3k | |
| 16 | WritingTools The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writi |
★ 2.3k | Swift |
| 17 | any-llm Communicate with an LLM provider using a single interface |
★ 2.1k | Python |
| 18 | ExtractThinker ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexi |
★ 1.6k | Python |
| 19 | awesome-llm-agents A curated list of awesome LLM agents frameworks. |
★ 1.5k | Python |
| 20 | mirascope The LLM Anti-Framework |
★ 1.5k | Python |
| 21 | tau2-bench τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains |
★ 1.5k | Python |
| 22 | desktop E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to |
★ 1.4k | Python |
| 23 | langtrace Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applicat |
★ 1.2k | TypeScript |
| 24 | LLM-ToolMaker |
★ 1.1k | Jupyter Notebook |
| 25 | NyaProxy NyaProxy acts like a smart, central manager for accessing various online services (APIs) – think AI |
★ 959 | Python |
| 26 | ontogpt LLM-based ontological extraction tools, including SPIRES |
★ 907 | Jupyter Notebook |
| 27 | kani kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 202 |
★ 604 | Python |
| 28 | echo The User Pays AI SDK |
★ 539 | TypeScript |
| 29 | LLM-Tool-Survey This is the repository for the Tool Learning survey. |
★ 481 | |
| 30 | vllm-cli A command-line interface tool for serving LLM using vLLM. |
★ 480 | Python |
| 31 | ToolNeuron On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable |
★ 397 | Kotlin |
| 32 | chatgpt-subtitle-translator Efficient translation tool based on ChatGPT or any OpenAI compatible LLM chat completion API |
★ 381 | JavaScript |
| 33 | toolkit.dev Get paid to build LLM tools |
★ 356 | TypeScript |
| 34 | claude-code-trace Claude Code session log viewer for JSONL files in ~/.claude/projects. Browse conversations, tool cal |
★ 328 | Rust |
| 35 | OpenRCA [ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures? |
★ 318 | Python |
| 36 | llms-tools A list of LLMs Tools & Projects |
★ 316 | |
| 37 | leanctx Drop-in prompt compression for production LLM apps. Cut your token bill 40-60% without changing your |
★ 312 | Python |
| 38 | openai-function-calling-tools 🛠 openai function calling tools for JS/TS |
★ 307 | TypeScript |
| 39 | bocoel Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) tha |
★ 289 | Python |
| 40 | SecGPT A Test Project for a Network Security-oriented LLM Tool Emulating AutoGPT |
★ 287 | Python |
| 41 | input0 Input0 — A macOS voice input tool: hold a hotkey to record, release to transcribe locally via STT, r |
★ 260 | Rust |
| 42 | gpt_server gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。 |
★ 253 | Python |
| 43 | codinit-dev Local-First Open Source web & mobile AI app builder — install on MacOS, Windows & Linux |
★ 235 | TypeScript |
| 44 | cai User friendly CLI tool for AI tasks. Stop thinking about LLMs and prompts, start getting results! |
★ 202 | Rust |
| 45 | llm-rl-environments-lil-course 🌱 A little course on Reinforcement Learning Environments for evaluating and training Language Model |
★ 201 | Python |
| 46 | Eridanus 基于 OneBot 协议的多功能bot兼开发框架。以llm function calling为核心构建了更智能的功能调用机制。支持不接入QQ纯Live2d桌宠模式 |
★ 190 | Python |
| 47 | aitools_client Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movie |
★ 185 | C# |
| 48 | marginalia A library-science-inspired personal knowledge management system with LLM agents |
★ 185 | Python |
| 49 | LLM-Tools Open-source calculator for LLM system requirements. |
★ 175 | Python |
| 50 | toolbench ToolBench, an evaluation suite for LLM tool manipulation capabilities. |
★ 172 | Python |
| 51 | chatlab ⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol |
★ 159 | Jupyter Notebook |
| 52 | llm_intents Exposes internet search tools for use by LLM-backed Assist in Home Assistant |
★ 157 | Python |
| 53 | Awesome-Multi-Token-Prediction A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related technique |
★ 143 | |
| 54 | OSA Tool that just makes your open source project better using LLM agents |
★ 141 | Python |
| 55 | MLLM-Tool MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning |
★ 138 | Python |
| 56 | tokuin CLI tool – estimates LLM tokens/costs and runs provider-aware load tests for OpenAI, Anthropic, Open |
★ 138 | Rust |
| 57 | lm-proxy OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, |
★ 135 | Python |
| 58 | multi-model-chat Multi-Model Chat — Compare responses from multiple AI models side by side in real-time. Supports GPT |
★ 131 | TypeScript |
| 59 | open-creator An open-source LLM tool for extracting repeatable tasks from your conversations, and saving them int |
★ 129 | Python |
| 60 | banks LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and cha |
★ 127 | Python |
| 61 | bpmn-assistant LLM-powered assistant for creating, editing, and interpreting business process diagrams |
★ 124 | Python |
| 62 | cursive ✦ The intuitive LLM framework |
★ 114 | TypeScript |
| 63 | OpenWebui-Tools Custom tools to enhance your Open-Webui Experience! 🚀 |
★ 114 | Python |
| 64 | aulite EU AI Act compliance proxy for AI systems. Drop-in HTTP proxy that monitors every AI interaction for |
★ 112 | TypeScript |
| 65 | TokenTamer A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% w |
★ 112 | Python |
| 66 | dexter LLM tools used in production at Dexa |
★ 107 | TypeScript |
| 67 | LLM_trader LLM-powered Crypto Trading Framework with Vision AI chart analysis, real-time Neural Engine, and a l |
★ 98 | Python |
| 68 | pith Pith is the hook that makes Claude Code sessions last 3x longer. |
★ 96 | Python |
| 69 | llm-manpage-tool Inject relevant documentation into your prompts: 98% savings. |
★ 96 | Python |
| 70 | Eval High-performance LLM evaluation framework with parallel API calls — up to 17× faster than sequential |
★ 94 | Python |
| 71 | llm-tools-nmap |
★ 93 | Python |
| 72 | comfyui-llm-toolkit |
★ 86 | Python |
| 73 | llm-tool-collection Curated collection of tools for agentic LLMs in Emacs |
★ 83 | Emacs Lisp |
| 74 | outputguard Validate, repair, and retry LLM structured outputs. 13 repair strategies for common JSON malformatio |
★ 80 | Python |
| 75 | llm-tools-kiwix Turn any Kiwix ZIM archive (offline Wikipedia, Stack Exchange, DevDocs, etc.) into an instant knowle |
★ 77 | Python |
| 76 | awesome-pydantic-ai An opinionated list of awesome Pydantic-AI frameworks, libraries, software and resources. |
★ 73 | |
| 77 | AgentGuard AgentGuard:An Attribute-Based Access Control Framework for Tool-Use LLM-Based Agent |
★ 71 | Python |
| 78 | tessera From teacher to tiles — a from-scratch LLM distillation & serving engine: custom Triton/CUDA kernels |
★ 69 | Python |
| 79 | llm_context_benchmarks 📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context |
★ 68 | Python |
| 80 | pls CLI tool that turns natural language into shell commands via LLM |
★ 66 | Zig |
| 81 | reverse-engineering-is-over LLMs have ended reverse engineering as a high-barrier skill. Case study: reconstructing 6/7 custom H |
★ 66 | Python |
| 82 | LLM-Research A collection of LLM related papers, thesis, tools, datasets, courses, benchmarks |
★ 64 | Python |
| 83 | garycli The Spear Carrier. AI-native CLI agent for STM32 embedded development and automated debugging. (Gary |
★ 56 | Python |
| 84 | ue-llm-toolkit A set of tools allowing LLMs to see inside Unreal Engine 5 projects. |
★ 50 | C++ |
| 85 | control-layer A production-grade control layer that sits between your application logic and any LLM — input valida |
★ 50 | Python |
| 86 | lx CLI tool for bundling project files as LLM context. |
★ 49 | Go |
| 87 | WakenLLM-toolkit toolkit for WakenLLM framework |
★ 47 | Python |
| 88 | AddOn Home Assistant Community Cloud with secure, private, and free remote access, webrtc streaming, ChatG |
★ 45 | Python |
| 89 | better-sidebar-for-google-gemini-and-ai-studio Enhance your Google AI Studio experience with a better sidebar, folders, tags, prompt manager, and f |
★ 45 | TypeScript |
| 90 | Supervertaler-Workbench Open-source, AI-enhanced CAT tool with multi-LLM support, translation memory, glossary management, ' |
★ 43 | Python |
| 91 | excessibility Accessibility snapshot testing for Phoenix LiveView - capture HTML during tests, run Pa11y for WCAG |
★ 40 | Elixir |
| 92 | optipfair Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and |
★ 40 | Python |
| 93 | awesome-elixir-llm-genai A list of LLM and GenAI Elixir Resources/Tools |
★ 40 | |
| 94 | LLM-PromptEngineering-Agents ChatGPT, related application + prompt engineering list |
★ 36 | |
| 95 | simple-llm-cli LLM infradebugging and diagnostic tool |
★ 36 | Python |
| 96 | glimpser a simple tool for real-time monitoring video and summarization with LLMs |
★ 34 | Python |
| 97 | TensorFold Run MoE LLMs on Apple Silicone via MLX that your Mac should not normally be able to run |
★ 34 | Python |
| 98 | ica-lens-paper ICA Lens: compact ICA-based interpretability tools for exploring LLM activations. Code release for t |
★ 33 | Python |
| 99 | pyrlm-runtime Minimal runtime for Recursive Language Models (RLMs) inspired by the MIT CSAIL paper "Recursive Lang |
★ 32 | Python |
| 100 | video-helper 📺 AI视频学习助手: 自动生成 B站/YouTube/抖音/本地视频思维导图、笔记与总结。支持播客分析与视频索引,开源平替。AI Video Learning Assistant: Auto-ge |
★ 32 | Python |