web-codegen-scorer — Agent Tool by angular

by angular · Agent Tool · ★ 730

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About web-codegen-scorer

Web Codegen Scorer Web Codegen Scorer is a tool for evaluating the quality of web code generated by Large Language Models (LLMs). You can use this tool to make evidence-based decisions relating to AI-generated code. For example: 🔄 Iterate on a system prompt to find most effective instructions for your project. ⚖️ Compare the code quality of code produced by different models. 📈 Monitor generated code quality over time as models and agents evolve. Web Codegen Scorer is different from other code benchmarks in that it focuses specifically on web code and relies primarily on well-established measures of code quality. Features ⚙️ Configure your evaluations with different models, frameworks, and tools. ✍️ Specify system instructions and add MCP servers. 📋 Use built-in checks for build success, runtime errors, accessibility, security, LLM rating, and coding best practices. (More built-in checks coming soon!) 🔧 Automatically attempt to repair issues detected during code generating. 📊 View and compare results with an intuitive report viewer UI.

benchmarkingcodegenevaluationllm-coding

Quick Facts

Stars730
Forks61
LanguageTypeScript
CategoryAgent Tool
LicenseMIT
Quality Score44.75/100
Open Issues13
Last Updated2026-05-05
Created2025-09-04
Platformsbrowser, node
Est. Tokens~175k

Compatible Skills

These tools work well together with web-codegen-scorer for enhanced workflows:

  • kubb — semantic(0.15)+complementary+rare_topics+same_lang+similar_pop+shared_platform (55%)
  • mcp — semantic(0.18)+complementary+same_lang+similar_pop+shared_platform (51%)

web-codegen-scorer alternative? Top 6 similar tools

Looking for a web-codegen-scorer alternative? If you're comparing web-codegen-scorer with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • bocoel by rentruewang · ⭐ 289

    Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 tim

  • crystal by stravu · ⭐ 3.0k

    (Crystal is now Nimbalyst) Run multiple Codex and Claude Code AI sessions in parallel git worktrees. Test, com

  • inspector by MCPJam · ⭐ 2.0k

    Testing and evaluation platform to chat, inspect, and debug MCP servers, MCP apps, and ChatGPT apps.

  • kubb by kubb-labs · ⭐ 1.7k

    🧡 The meta framework for code generation. Automate OpenAPI to type-safe TypeScript, Zod, and TanStack Query w

  • trpc-agent-go by trpc-group · ⭐ 1.4k

    A Go framework for building production agent systems with graph workflows, tools, memory, A2A, AG-UI, MCP, eva

  • OpenJudge by agentscope-ai · ⭐ 673

    OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular TypeScript Agent Tools

Frequently Asked Questions

What is web-codegen-scorer?

web-codegen-scorer is Web Codegen Scorer is a tool for evaluating the quality of web code generated by LLMs.. It is categorized as a Agent Tool with 730 GitHub stars.

What programming language is web-codegen-scorer written in?

web-codegen-scorer is primarily written in TypeScript. It covers topics such as benchmarking, codegen, evaluation.

How do I install or use web-codegen-scorer?

You can find installation instructions and usage details in the web-codegen-scorer GitHub repository at github.com/angular/web-codegen-scorer. The project has 730 stars and 61 forks, indicating an active community.

What license does web-codegen-scorer use?

web-codegen-scorer is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to web-codegen-scorer?

The top alternatives to web-codegen-scorer on Agent Skills Hub include bocoel, crystal, inspector. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools