by sambanova · LLM Plugin · ★ 172
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
ToolBench Recent studies on software tool manipulation with large language models (LLMs) mostly rely on closed model APIs (e.g. OpenAI), as there is an significant gap of model accuracy between those closed models and all the rest open-source LLMs. To study the root cause of the gap and further facilitate the development of open-source LLMs, especially their capabilities on tool manipulation, we create the ToolBench. The ToolBench is a benchmark consisting of diverse software tools for real-world tasks. We also provide easy-to-use infrastructure in this repository to directly evaluate the execution success rate of each model. Contributions to this repo are highly welcomed! We are excited to see new action generation algorithms and new testing tasks. #
| Stars | 172 |
| Forks | 11 |
| Language | Python |
| Category | LLM Plugin |
| License | Apache-2.0 |
| Quality Score | 69.279476379072/100 |
| Open Issues | 1 |
| Last Updated | 2024-02-28 |
| Created | 2023-05-19 |
| Platforms | python |
| Est. Tokens | ~50k |
Looking for a toolbench alternative? If you're comparing toolbench with other llm plugin tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
The User Pays AI SDK
Local-First Open Source web & mobile AI app builder — install on MacOS, Windows & Linux
OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch).
Multi-Model Chat — Compare responses from multiple AI models side by side in real-time. Supports GPT, Claude,
LLM-powered assistant for creating, editing, and interpreting business process diagrams
✦ The intuitive LLM framework
Explore other popular llm plugin tools:
toolbench is ToolBench, an evaluation suite for LLM tool manipulation capabilities.. It is categorized as a LLM Plugin with 172 GitHub stars.
toolbench is primarily written in Python.
You can find installation instructions and usage details in the toolbench GitHub repository at github.com/sambanova/toolbench. The project has 172 stars and 11 forks, indicating an active community.
toolbench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.
The top alternatives to toolbench on Agent Skills Hub include echo, codinit-dev, lm-proxy. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.