GTA — Agent Tool by open-compass

by open-compass · Agent Tool · ★ 142

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About GTA

GTA: General Tool Agent Benchmark and Evaluation Framework [[NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents](https://proceedings.neurips.cc/paperfiles/paper/2024/file/8a75ee6d4b2eb0b777f549a32a5a5c28-Paper-DatasetsandBenchmarksTrack.pdf) [[arXiv 2026] GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows](https://arxiv.org/pdf/2604.15715) ⬇️ Download Dataset Here: [GTA-Atomic] [GTA-Workflow] 🌟 Introduction GTA-2 is a benchmark and evaluation kit for General Tool Agents, designed to bridge atomic tool-use evaluation and open-ended workflow evaluation in one repository. Benchmark hierarchy GTA-Workflow: the new focus of GTA-2, for long-horizon, open-ended workflow evaluation. GTA-Atomic: the original GTA benchmark for short-horizon atomic tool-use tasks. Please refer to READMEGTA-1.md. This readme is centered around GTA-Workflow, which targets realistic long-horizon tasks with open-ended deliverables. Compared with traditional benchmark-style evaluation, GTA-Workflow focuses more on what an agent can finally accomplish in a complete workflow, rather than only whether it predicts the next tool

llm-agentllm-evaluation

Quick Facts

Stars142
Forks9
LanguagePython
CategoryAgent Tool
LicenseApache-2.0
Quality Score64.602976318854/100
Last Updated2026-04-20
Created2024-06-06
Platformspython
Est. Tokens~789k

GTA alternative? Top 6 similar tools

Looking for a GTA alternative? If you're comparing GTA with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • Awesome-LLM-in-Social-Science by ValueByte-AI · ⭐ 627

    Awesome papers involving LLMs in Social Science.

  • palico-ai by palico-ai · ⭐ 342

    Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework

  • WebCanvas by iMeanAI · ⭐ 276

    All-in-one Web Agent framework for post-training. Start building with a few clicks!

  • CivAgent by fuxiAIlab · ⭐ 147

    CivAgent is an LLM-based Human-like Agent acting as a Digital Player within the Strategy Game Unciv.

  • Awesome-LLMs-ICLR-24 by azminewasi · ⭐ 66

    It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learni

  • wcgw by rusiaaman · ⭐ 655

    Shell and coding agent on mcp clients

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is GTA?

GTA is [NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2. It is categorized as a Agent Tool with 142 GitHub stars.

What programming language is GTA written in?

GTA is primarily written in Python. It covers topics such as llm-agent, llm-evaluation.

How do I install or use GTA?

You can find installation instructions and usage details in the GTA GitHub repository at github.com/open-compass/GTA. The project has 142 stars and 9 forks, indicating an active community.

What license does GTA use?

GTA is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to GTA?

The top alternatives to GTA on Agent Skills Hub include Awesome-LLM-in-Social-Science, palico-ai, WebCanvas. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools