videogui — Agent Tool by showlab

by showlab · Agent Tool · ★ 51

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About videogui

VideoGUI: A Benchmark for GUI Automation from Instructional Videos) Kevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou 📢 News [2025.6] We release all metadata and human recording at Google-Drive. [2024.6] We release the arXiv paper. [2024.9] Accepted by NeurIPS 2024 D&B. [2024.10] We released the data at Huggingface dataset. Please stay tuned for further updates. 📝 TODO [ ] Upload the Evaluation code and metric implementation. [x] Upload the Missed metadata. 📖 Introduction TL;DR: A Multi-modal Benchmark for Visual-centric GUI Automation from Instructional Videos. Visual-centric softwares and tasks: VideoGUI focuses on professional and novel software like PR and AE for video editing, or Stable Diffusion and Runway for visual creation. Besides, the task query emphasizes

guillm-agentvideo-language

Quick Facts

Stars51
Forks3
LanguageJavaScript
CategoryAgent Tool
Quality Score58.290994500065/100
Open Issues2
Last Updated2026-02-22
Created2024-06-16
Platformsnode
Est. Tokens~2129k

videogui alternative? Top 6 similar tools

Looking for a videogui alternative? If you're comparing videogui with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • taskyon by Xyntopia · ⭐ 52

    Browser based Interface for Generative AI. Chat/Agent/Taskmanager Hybrid.

  • LLM-Brained-GUI-Agents-Survey by vyokky · ⭐ 220

    GitHub page for "Large Language Model-Brained GUI Agents: A Survey"

  • EdgeBox by BIGPPWONG · ⭐ 198

    A fully-featured, GUI-powered local LLM Agent sandbox with complete MCP protocol support. Features both CLI

  • os-ai-computer-use by 777genius · ⭐ 165

    AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app

  • Claude-Code-Board by cablate · ⭐ 144

    Use Claude Code on Kanban WebUI

  • Awesome-AI-For-Security by AmanPriyanshu · ⭐ 131

    A curated list of tools, papers, and datasets for applying AI to cybersecurity tasks. This list primarily focu

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular JavaScript Agent Tools

Frequently Asked Questions

What is videogui?

videogui is [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos. It is categorized as a Agent Tool with 51 GitHub stars.

What programming language is videogui written in?

videogui is primarily written in JavaScript. It covers topics such as gui, llm-agent, video-language.

How do I install or use videogui?

You can find installation instructions and usage details in the videogui GitHub repository at github.com/showlab/videogui. The project has 51 stars and 3 forks, indicating an active community.

What are the best alternatives to videogui?

The top alternatives to videogui on Agent Skills Hub include taskyon, LLM-Brained-GUI-Agents-Survey, EdgeBox. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools