OmniParser — Agent Tool by microsoft

by microsoft · Agent Tool · ★ 24.6k

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About OmniParser

OmniParser: Screen Parsing tool for Pure Vision Based GUI Agent -- 📢 [Project Page] [V2 Blog Post] [Models V2] [Models V1.5] [HuggingFace Space Demo] OmniParser is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that can be accurately grounded in the corresponding regions of the interface. News [2025/3] We support local logging of trajecotry so that you can use OmniParser+OmniTool to build training data pipeline for your favorate agent in your domain. [Documentation WIP] [2025/3] We are gradually adding multi agents orchstration and im

Quick Facts

Stars24,633
Forks2,157
LanguageJupyter Notebook
CategoryAgent Tool
LicenseCC-BY-4.0
Quality Score65.649455723609/100
Open Issues232
Last Updated2026-04-13
Created2024-09-20
Est. Tokens~3976k

OmniParser alternative? Top 1 similar tools

Looking for a OmniParser alternative? If you're comparing OmniParser with other agent tool tools, these 1 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • open-saas by wasp-lang · ⭐ 14.7k

    A 100% free modern JS SaaS boilerplate (React, NodeJS, Prisma). Full-featured: Auth (email, google, github, sl

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Jupyter Notebook Agent Tools

Frequently Asked Questions

What is OmniParser?

OmniParser is A simple screen parsing tool towards pure vision based GUI agent. It is categorized as a Agent Tool with 24.6k GitHub stars.

What programming language is OmniParser written in?

OmniParser is primarily written in Jupyter Notebook.

How do I install or use OmniParser?

You can find installation instructions and usage details in the OmniParser GitHub repository at github.com/microsoft/OmniParser. The project has 24.6k stars and 2157 forks, indicating an active community.

What license does OmniParser use?

OmniParser is released under the CC-BY-4.0 license, making it free to use and modify according to the license terms.

What are the best alternatives to OmniParser?

The top alternatives to OmniParser on Agent Skills Hub include open-saas. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools