toolbench — LLM Plugin by sambanova

by sambanova · LLM Plugin · ★ 172

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About toolbench

ToolBench Recent studies on software tool manipulation with large language models (LLMs) mostly rely on closed model APIs (e.g. OpenAI), as there is an significant gap of model accuracy between those closed models and all the rest open-source LLMs. To study the root cause of the gap and further facilitate the development of open-source LLMs, especially their capabilities on tool manipulation, we create the ToolBench. The ToolBench is a benchmark consisting of diverse software tools for real-world tasks. We also provide easy-to-use infrastructure in this repository to directly evaluate the execution success rate of each model. Contributions to this repo are highly welcomed! We are excited to see new action generation algorithms and new testing tasks. #

Quick Facts

Stars172
Forks11
LanguagePython
CategoryLLM Plugin
LicenseApache-2.0
Quality Score69.279476379072/100
Open Issues1
Last Updated2024-02-28
Created2023-05-19
Platformspython
Est. Tokens~50k

toolbench alternative? Top 6 similar tools

Looking for a toolbench alternative? If you're comparing toolbench with other llm plugin tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • echo by Merit-Systems · ⭐ 539

    The User Pays AI SDK

  • codinit-dev by codinit-dev · ⭐ 235

    Local-First Open Source web & mobile AI app builder — install on MacOS, Windows & Linux

  • lm-proxy by Nayjest · ⭐ 135

    OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch).

  • multi-model-chat by seehiong · ⭐ 131

    Multi-Model Chat — Compare responses from multiple AI models side by side in real-time. Supports GPT, Claude,

  • bpmn-assistant by jtlicardo · ⭐ 124

    LLM-powered assistant for creating, editing, and interpreting business process diagrams

  • cursive by meistrari · ⭐ 114

    ✦ The intuitive LLM framework

More LLM Plugin Tools

Explore other popular llm plugin tools:

View all LLM Plugin tools →

Popular Python Agent Tools

Frequently Asked Questions

What is toolbench?

toolbench is ToolBench, an evaluation suite for LLM tool manipulation capabilities.. It is categorized as a LLM Plugin with 172 GitHub stars.

What programming language is toolbench written in?

toolbench is primarily written in Python.

How do I install or use toolbench?

You can find installation instructions and usage details in the toolbench GitHub repository at github.com/sambanova/toolbench. The project has 172 stars and 11 forks, indicating an active community.

What license does toolbench use?

toolbench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to toolbench?

The top alternatives to toolbench on Agent Skills Hub include echo, codinit-dev, lm-proxy. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse LLM Plugin tools