by pmbstyle · Agent Tool · ★ 64
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
Gemini Browser Agent A research experiment and browser automation project scaffolding. Run agent tasks right in your Chrome browser. Overview Gemini Browser Agent is an automation agent that bridges a Chrome extension with Google’s Gemini Computer Use API. It observes the active tab, exchanges screenshots and events with the model, and performs actions directly in your own browser, no sandbox or virtual machine required. Setup Install Python 3.10+ and Chrome (or Chromium-based) browser. Clone this repository and open a terminal in the project directory. (Optional) Create a virtual environment: Run the setup helper to install dependencies and scaffold : Visit to create a Gemini API key, then place it in the generated file as . Usage Start the Python WebSocket bridge: Open Chrome and navigate to . Enable Developer mode, choose Load unpacked, and select the directory from this project. Open the sidebar, click
| Stars | 64 |
| Forks | 15 |
| Language | JavaScript |
| Category | Agent Tool |
| Quality Score | 42.7/100 |
| Last Updated | 2025-10-23 |
| Created | 2025-10-23 |
| Platforms | browser, gemini, node |
| Est. Tokens | ~2k |
These tools work well together with gemini-browser-agent for enhanced workflows:
Looking for a gemini-browser-agent alternative? If you're comparing gemini-browser-agent with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Agent-driven Chrome extension for full browser control via CLI
The open-source execution engine for AI agents. 412 modules, MCP-native, triggers, queue, versioning, metering
🔮 Platform-neutral Browser Use for AI agents: real Chrome automation with a CLI + SDKs, no lock-in, dead simp
◉ Universal Intelligence: AI made simple.
Index the world's undocumented APIs
AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app
Explore other popular agent tool tools:
gemini-browser-agent is A browser agent with a Google Chrome extension that can work in your browser. Based on Google Gemini 2.5 computer use model.. It is categorized as a Agent Tool with 64 GitHub stars.
gemini-browser-agent is primarily written in JavaScript. It covers topics such as ai-agents, ai-browser-automation, browser-automation.
You can find installation instructions and usage details in the gemini-browser-agent GitHub repository at github.com/pmbstyle/gemini-browser-agent. The project has 64 stars and 15 forks, indicating an active community.
The top alternatives to gemini-browser-agent on Agent Skills Hub include Interceptor, flyto-core, open-browser-use. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.