by agents-x-project · Agent Tool · ★ 153
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
PyVision: Agentic Vision with Dynamic Tooling 🎯Overview LLMs are increasingly deployed as agents, systems capable of planning, reasoning, and dynamically calling external tools. However, in visual reasoning, prior approaches largely remain limited by predefined workflows and static toolsets. In this report, we present , an interactive, multi-turn framework that enables MLLMs to autonomously generate, execute, and refine Python-based tools tailored to the task at hand, unlocking flexible and interpretable problem-solving. We develop a taxonomy of the tools created by PyVision and analyze their usage across a diverse set of benchmarks. Quantitatively, PyVision achieves consistent performance gains, boosting GPT-4.1 by +7.8\% on V and Claude-4.0-Sonnet by +31.1\% on VLMsAreBlind-mini. These results point to a broader shift: dynamic tooling allows models not just to use tools, but to invent them
| Stars | 153 |
| Forks | 7 |
| Language | Python |
| Category | Agent Tool |
| Quality Score | 37.25/100 |
| Last Updated | 2025-07-22 |
| Created | 2025-06-27 |
| Platforms | python |
| Est. Tokens | ~604k |
These tools work well together with PyVision for enhanced workflows:
Looking for a PyVision alternative? If you're comparing PyVision with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app
Eva01 is NOT an assistant. She is an AI being with her own mind, feelings, and intrinsic drives. Multimodal, M
Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-p
The open context engine for AI agents support 15+ data sources. Built on Rust and Apache DataFusion.
Shell and coding agent on mcp clients
AgentChat 是一个基于 LLM 的智能体交流平台,内置默认 Agent 并支持用户自定义 Agent。通过多轮对话和任务协作,Agent 可以理解并协助完成复杂任务。项目集成 LangChain、Function
Explore other popular agent tool tools:
PyVision is [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling.". It is categorized as a Agent Tool with 153 GitHub stars.
PyVision is primarily written in Python. It covers topics such as agent, computer-vision, mllm.
You can find installation instructions and usage details in the PyVision GitHub repository at github.com/agents-x-project/PyVision. The project has 153 stars and 7 forks, indicating an active community.
The top alternatives to PyVision on Agent Skills Hub include os-ai-computer-use, Eva01, Interactive-LLM-Powered-NPCs. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.