by THUDM · Agent Tool · ★ 3.2k
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
AgentBench 🌐 Leaderboard (new) 📃 Paper 👋 Join our Slack for Q & A or collaboration on next version of AgentBench! 🔥[2025.10.10] Introducing AgentBench FC (Function Calling) based on AgentRL The current repository contains the function-calling version of AgentBench, integrated with AgentRL, an end-to-end multitask and mutliturn LLM Agent RL framework. If you wish to use the older version, you can revert to v0.1 and v0.2. Comparing to the original AgentBench, this version uses a function-calling style prompt, and adds fully-containerized deployment support for the following tasks: (AF) (DB) (KG) (OS) (WS) Quick Start We support a
| Stars | 3,214 |
| Forks | 240 |
| Language | Python |
| Category | Agent Tool |
| License | Apache-2.0 |
| Quality Score | 67.438242037414/100 |
| Open Issues | 68 |
| Last Updated | 2026-02-08 |
| Created | 2023-07-28 |
| Platforms | python |
| Est. Tokens | ~1970k |
Looking for a AgentBench alternative? If you're comparing AgentBench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Harness LLMs with Multi-Agent Programming
A curated list of Generative AI tools, works, models, and references
🚀💪Maximize your efficiency and productivity. The ultimate hub to manage, customize, and share prompts. (Engl
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make
总结Prompt&LLM论文,开源数据&模型,AIGC应用
Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 V
Explore other popular agent tool tools:
AgentBench is A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24). It is categorized as a Agent Tool with 3.2k GitHub stars.
AgentBench is primarily written in Python. It covers topics such as chatgpt, gpt-4, llm.
You can find installation instructions and usage details in the AgentBench GitHub repository at github.com/THUDM/AgentBench. The project has 3.2k stars and 240 forks, indicating an active community.
AgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.
The top alternatives to AgentBench on Agent Skills Hub include langroid, awesome-generative-ai, ChatGPT-Shortcut. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.