VisualAgentBench — Agent Tool by THUDM

by THUDM · Agent Tool · ★ 256

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About VisualAgentBench

Towards Large Multimodal Models as Visual Foundation Agents

gptllm-agentmultimodal-large-language-models

Quick Facts

Stars256
Forks9
LanguagePython
CategoryAgent Tool
LicenseApache-2.0
Quality Score39.2/100
Open Issues16
Last Updated2025-04-24
Created2024-08-08
Platformspython
Est. Tokens~378k

Compatible Skills

These tools work well together with VisualAgentBench for enhanced workflows:

  • multimind-sdk — semantic(0.31)+complementary+rare_topics+same_lang+similar_pop+shared_platform (60%)
  • MLLM-Tool — semantic(0.34)+complementary+same_lang+similar_pop+shared_platform (57%)
  • OpenAdapt — semantic(0.34)+complementary+same_lang+similar_pop+shared_platform (57%)
  • SimplerLLM — semantic(0.30)+complementary+same_lang+similar_pop+shared_platform (56%)
  • multimodal-agents-course — semantic(0.27)+complementary+same_lang+similar_pop+shared_platform (54%)

VisualAgentBench alternative? Top 6 similar tools

Looking for a VisualAgentBench alternative? If you're comparing VisualAgentBench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • tribe by StreetLamb · ⭐ 1.1k

    Low code tool to rapidly build and coordinate multi-agent teams

  • langtrace by Scale3-Labs · ⭐ 1.2k

    Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, prov

  • Awesome-GUI-Agent by showlab · ⭐ 1.1k

    💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

  • llm-for-zotero by yilewang · ⭐ 1.1k

    A research agent system deeply rooted in your own Zotero library.

  • groundingLMM by mbzuai-oryx · ⭐ 945

    [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating nat

  • chatgpt-cli by kardolus · ⭐ 901

    ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports Ope

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is VisualAgentBench?

VisualAgentBench is Towards Large Multimodal Models as Visual Foundation Agents. It is categorized as a Agent Tool with 256 GitHub stars.

What programming language is VisualAgentBench written in?

VisualAgentBench is primarily written in Python. It covers topics such as gpt, llm-agent, multimodal-large-language-models.

How do I install or use VisualAgentBench?

You can find installation instructions and usage details in the VisualAgentBench GitHub repository at github.com/THUDM/VisualAgentBench. The project has 256 stars and 9 forks, indicating an active community.

What license does VisualAgentBench use?

VisualAgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to VisualAgentBench?

The top alternatives to VisualAgentBench on Agent Skills Hub include tribe, langtrace, Awesome-GUI-Agent. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools