gemini-browser-agent — Agent Tool by pmbstyle

by pmbstyle · Agent Tool · ★ 64

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About gemini-browser-agent

Gemini Browser Agent A research experiment and browser automation project scaffolding. Run agent tasks right in your Chrome browser. Overview Gemini Browser Agent is an automation agent that bridges a Chrome extension with Google’s Gemini Computer Use API. It observes the active tab, exchanges screenshots and events with the model, and performs actions directly in your own browser, no sandbox or virtual machine required. Setup Install Python 3.10+ and Chrome (or Chromium-based) browser. Clone this repository and open a terminal in the project directory. (Optional) Create a virtual environment: Run the setup helper to install dependencies and scaffold : Visit to create a Gemini API key, then place it in the generated file as . Usage Start the Python WebSocket bridge: Open Chrome and navigate to . Enable Developer mode, choose Load unpacked, and select the directory from this project. Open the sidebar, click

ai-agentsai-browser-automationbrowser-automationbrowser-extensionbrowser-usegemini-apijavascriptpython

Quick Facts

Stars64
Forks15
LanguageJavaScript
CategoryAgent Tool
Quality Score42.7/100
Last Updated2025-10-23
Created2025-10-23
Platformsbrowser, gemini, node
Est. Tokens~2k

Compatible Skills

These tools work well together with gemini-browser-agent for enhanced workflows:

  • vibe-annotations — semantic(0.24)+complementary+rare_topics+same_lang+similar_pop+shared_platform (58%)
  • claude-chromium-native-messaging — semantic(0.47)+complementary+rare_topics+similar_pop+shared_platform (56%)
  • claude-mcp — semantic(0.31)+complementary+same_lang+similar_pop+shared_platform (56%)

gemini-browser-agent alternative? Top 6 similar tools

Looking for a gemini-browser-agent alternative? If you're comparing gemini-browser-agent with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • Interceptor by Hacker-Valley-Media · ⭐ 275

    Agent-driven Chrome extension for full browser control via CLI

  • flyto-core by flytohub · ⭐ 275

    The open-source execution engine for AI agents. 412 modules, MCP-native, triggers, queue, versioning, metering

  • open-browser-use by iFurySt · ⭐ 104

    🔮 Platform-neutral Browser Use for AI agents: real Chrome automation with a CLI + SDKs, no lock-in, dead simp

  • universal-intelligence by blueraai · ⭐ 55

    ◉ Universal Intelligence: AI made simple.

  • bluebox by VectorlyApp · ⭐ 191

    Index the world's undocumented APIs

  • os-ai-computer-use by 777genius · ⭐ 165

    AI controls your OS. OS AI Computer Use, OS and API agnostic. For now on OpenAI and Anthropic API. Desktop app

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular JavaScript Agent Tools

Frequently Asked Questions

What is gemini-browser-agent?

gemini-browser-agent is A browser agent with a Google Chrome extension that can work in your browser. Based on Google Gemini 2.5 computer use model.. It is categorized as a Agent Tool with 64 GitHub stars.

What programming language is gemini-browser-agent written in?

gemini-browser-agent is primarily written in JavaScript. It covers topics such as ai-agents, ai-browser-automation, browser-automation.

How do I install or use gemini-browser-agent?

You can find installation instructions and usage details in the gemini-browser-agent GitHub repository at github.com/pmbstyle/gemini-browser-agent. The project has 64 stars and 15 forks, indicating an active community.

What are the best alternatives to gemini-browser-agent?

The top alternatives to gemini-browser-agent on Agent Skills Hub include Interceptor, flyto-core, open-browser-use. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools