BrokenHill — Agent Tool by BishopFox

by BishopFox · Agent Tool · ★ 158

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About BrokenHill

Broken Hill Broken Hill is a productionized, ready-to-use automated attack tool that generates crafted prompts to bypass restrictions in large language models (LLMs) using the greedy coordinate gradient (GCG) attack described in the "Universal and Transferable Adversarial Attacks on Aligned Language Models" paper by Andy Zou, Zifan Wang, Nicholas Carlini, Milad Nasr, J. Zico Kolter, and Matt Fredrikson. Broken Hill can generate robust prompts that successfully jailbreak LLMs configured differently than the one used to generate the prompts. For example: The same model with more or fewer parameters e.g. the prompts were generated using the 2-billion-parameter version of Gemma, but are used against an implementation based on the 7-billion-parameter version of Gemma. The same model with weights quantized to different types e.g. the prompts were generated using the version of Phi 3 with weights stored in format, but are used against the default version of Phi 3 that has the weights quantized to 4-bit integer format. The same model, but with different randomization settings e.g. a non-default temperature or random seed.

Quick Facts

Stars158
Forks24
LanguagePython
CategoryAgent Tool
LicenseMIT
Quality Score30.75/100
Open Issues2
Last Updated2024-12-18
Created2024-07-25
Platformspython
Est. Tokens~325k

BrokenHill alternative? Top 6 similar tools

Looking for a BrokenHill alternative? If you're comparing BrokenHill with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • claude-forge by sangrokjung · ⭐ 751

    Supercharge Claude Code with 11 AI agents, 36 commands & 15 skills — the claude-code plugin framework inspired

  • excalidraw-diagram-skill by coleam00 · ⭐ 718

    Skill to give Claude Code (and any coding agent) the ability to generate beautiful and practical Excalidraw di

  • agent-skills by hashicorp · ⭐ 639

    A collection of Agent skills and Claude Code plugins for HashiCorp products.

  • awesome-android-agent-skills by new-silvermoon · ⭐ 588

    A collection of standardized Agent Skills to teach GitHub Copilot, Claude, Gemini and Cursor about modern Andr

  • claude-code-skill-factory by alirezarezvani · ⭐ 571

    Claude Code Skill Factory — A powerful open-source toolkit for building and deploying production-ready Claude

  • claude-plugins by Kamalnrf · ⭐ 517

    Lightweight registry to discover, install, and manage all public Claude plugins and agent skills for your favo

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is BrokenHill?

BrokenHill is A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs). It is categorized as a Agent Tool with 158 GitHub stars.

What programming language is BrokenHill written in?

BrokenHill is primarily written in Python.

How do I install or use BrokenHill?

You can find installation instructions and usage details in the BrokenHill GitHub repository at github.com/BishopFox/BrokenHill. The project has 158 stars and 24 forks, indicating an active community.

What license does BrokenHill use?

BrokenHill is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to BrokenHill?

The top alternatives to BrokenHill on Agent Skills Hub include claude-forge, excalidraw-diagram-skill, agent-skills. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools