PyVision — Agent Tool by agents-x-project

by agents-x-project · Agent Tool · ★ 153

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About PyVision

PyVision: Agentic Vision with Dynamic Tooling 🎯Overview LLMs are increasingly deployed as agents, systems capable of planning, reasoning, and dynamically calling external tools. However, in visual reasoning, prior approaches largely remain limited by predefined workflows and static toolsets. In this report, we present , an interactive, multi-turn framework that enables MLLMs to autonomously generate, execute, and refine Python-based tools tailored to the task at hand, unlocking flexible and interpretable problem-solving. We develop a taxonomy of the tools created by PyVision and analyze their usage across a diverse set of benchmarks. Quantitatively, PyVision achieves consistent performance gains, boosting GPT-4.1 by +7.8\% on V and Claude-4.0-Sonnet by +31.1\% on VLMsAreBlind-mini. These results point to a broader shift: dynamic tooling allows models not just to use tools, but to invent them

agentcomputer-visionmllm

Quick Facts

Stars153
Forks7
LanguagePython
CategoryAgent Tool
Quality Score37.25/100
Last Updated2025-07-22
Created2025-06-27
Platformspython
Est. Tokens~604k

Compatible Skills

These tools work well together with PyVision for enhanced workflows:

  • vllm-mlx — semantic(0.22)+complementary+rare_topics+same_lang+similar_pop+shared_platform (62%)
  • os-ai-computer-use — semantic(0.24)+complementary+rare_topics+same_lang+similar_pop+shared_platform (58%)
  • imagesorcery-mcp — semantic(0.16)+complementary+rare_topics+same_lang+similar_pop+shared_platform (55%)
  • X-Master — semantic(0.22)+complementary+same_lang+similar_pop+shared_platform (53%)

PyVision alternative? Top 6 similar tools

Looking for a PyVision alternative? If you're comparing PyVision with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • os-ai-computer-use by 777genius · ⭐ 165

    AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app

  • Eva01 by Genesis1231 · ⭐ 143

    Eva01 is NOT an assistant. She is an AI being with her own mind, feelings, and intrinsic drives. Multimodal, M

  • Interactive-LLM-Powered-NPCs by AkshitIreddy · ⭐ 694

    Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-p

  • wren-engine by Canner · ⭐ 662

    The open context engine for AI agents support 15+ data sources. Built on Rust and Apache DataFusion.

  • wcgw by rusiaaman · ⭐ 655

    Shell and coding agent on mcp clients

  • AgentChat by Shy2593666979 · ⭐ 649

    AgentChat 是一个基于 LLM 的智能体交流平台,内置默认 Agent 并支持用户自定义 Agent。通过多轮对话和任务协作,Agent 可以理解并协助完成复杂任务。项目集成 LangChain、Function

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is PyVision?

PyVision is [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling.". It is categorized as a Agent Tool with 153 GitHub stars.

What programming language is PyVision written in?

PyVision is primarily written in Python. It covers topics such as agent, computer-vision, mllm.

How do I install or use PyVision?

You can find installation instructions and usage details in the PyVision GitHub repository at github.com/agents-x-project/PyVision. The project has 153 stars and 7 forks, indicating an active community.

What are the best alternatives to PyVision?

The top alternatives to PyVision on Agent Skills Hub include os-ai-computer-use, Eva01, Interactive-LLM-Powered-NPCs. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools