AgentBench — Agent Tool by THUDM

Last updated: 2026-02-08 · Indexed by AgentSkillsHub · Auto-synced every 8h

About AgentBench

AgentBench 🌐 Leaderboard (new) 📃 Paper 👋 Join our Slack for Q & A or collaboration on next version of AgentBench! 🔥[2025.10.10] Introducing AgentBench FC (Function Calling) based on AgentRL The current repository contains the function-calling version of AgentBench, integrated with AgentRL, an end-to-end multitask and mutliturn LLM Agent RL framework. If you wish to use the older version, you can revert to v0.1 and v0.2. Comparing to the original AgentBench, this version uses a function-calling style prompt, and adds fully-containerized deployment support for the following tasks: (AF) (DB) (KG) (OS) (WS) Quick Start We support a

chatgpt gpt-4 llm llm-agent

Quick Facts

Stars	3,214
Forks	240
Language	Python
Category	Agent Tool
License	Apache-2.0
Quality Score	67.438242037414/100
Open Issues	68
Last Updated	2026-02-08
Created	2023-07-28
Platforms	python
Est. Tokens	~1970k

AgentBench alternative? Top 6 similar tools

Looking for a AgentBench alternative? If you're comparing AgentBench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

langroid by langroid · ⭐ 4.0k
Harness LLMs with Multi-Agent Programming
awesome-generative-ai by filipecalegario · ⭐ 3.4k
A curated list of Generative AI tools, works, models, and references
ChatGPT-Shortcut by rockbenben · ⭐ 8.6k
🚀💪Maximize your efficiency and productivity. The ultimate hub to manage, customize, and share prompts. (Engl
gptme by gptme · ⭐ 4.3k
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make
DecryptPrompt by DSXiangLi · ⭐ 3.4k
总结Prompt&LLM论文，开源数据&模型，AIGC应用
aide by nicepkg · ⭐ 2.7k
Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 V

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

TrendRadar ⭐ 60.2k · MCP Server
gpt-researcher ⭐ 27.9k · MCP Server
Scrapling ⭐ 67.2k · MCP Server
serena ⭐ 26.0k · MCP Server
MaxKB ⭐ 21.6k · MCP Server

Frequently Asked Questions

What is AgentBench?

AgentBench is A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24). It is categorized as a Agent Tool with 3.2k GitHub stars.

What programming language is AgentBench written in?

AgentBench is primarily written in Python. It covers topics such as chatgpt, gpt-4, llm.

How do I install or use AgentBench?

You can find installation instructions and usage details in the AgentBench GitHub repository at github.com/THUDM/AgentBench. The project has 3.2k stars and 240 forks, indicating an active community.

What license does AgentBench use?

AgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to AgentBench?

The top alternatives to AgentBench on Agent Skills Hub include langroid, awesome-generative-ai, ChatGPT-Shortcut. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools