by showlab · Agent Tool · ★ 51
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
VideoGUI: A Benchmark for GUI Automation from Instructional Videos) Kevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou 📢 News [2025.6] We release all metadata and human recording at Google-Drive. [2024.6] We release the arXiv paper. [2024.9] Accepted by NeurIPS 2024 D&B. [2024.10] We released the data at Huggingface dataset. Please stay tuned for further updates. 📝 TODO [ ] Upload the Evaluation code and metric implementation. [x] Upload the Missed metadata. 📖 Introduction TL;DR: A Multi-modal Benchmark for Visual-centric GUI Automation from Instructional Videos. Visual-centric softwares and tasks: VideoGUI focuses on professional and novel software like PR and AE for video editing, or Stable Diffusion and Runway for visual creation. Besides, the task query emphasizes
| Stars | 51 |
| Forks | 3 |
| Language | JavaScript |
| Category | Agent Tool |
| Quality Score | 58.290994500065/100 |
| Open Issues | 2 |
| Last Updated | 2026-02-22 |
| Created | 2024-06-16 |
| Platforms | node |
| Est. Tokens | ~2129k |
Looking for a videogui alternative? If you're comparing videogui with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Browser based Interface for Generative AI. Chat/Agent/Taskmanager Hybrid.
GitHub page for "Large Language Model-Brained GUI Agents: A Survey"
A fully-featured, GUI-powered local LLM Agent sandbox with complete MCP protocol support. Features both CLI
AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app
Use Claude Code on Kanban WebUI
A curated list of tools, papers, and datasets for applying AI to cybersecurity tasks. This list primarily focu
Explore other popular agent tool tools:
videogui is [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos. It is categorized as a Agent Tool with 51 GitHub stars.
videogui is primarily written in JavaScript. It covers topics such as gui, llm-agent, video-language.
You can find installation instructions and usage details in the videogui GitHub repository at github.com/showlab/videogui. The project has 51 stars and 3 forks, indicating an active community.
The top alternatives to videogui on Agent Skills Hub include taskyon, LLM-Brained-GUI-Agents-Survey, EdgeBox. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.