auto-evaluator — Agent Tool by rlancemartin

by rlancemartin · Agent Tool · ★ 1.1k

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About auto-evaluator

:brain: :memo: Note See the HuggingFace space for this app: https://huggingface.co/spaces/rlancemartin/auto-evaluator Note See the hosted app: https://autoevaluator.langchain.com/ Note Code for the hosted app is also open source: https://github.com/langchain-ai/auto-evaluator This is a lightweight evaluation tool for question-answering using Langchain to: Ask the user to input a set of documents of interest Apply an LLM () to auto-generate - pairs from these docs Generate a question-answering chain with a specified set of UI-chosen configurations Use the chain to generate a response to each Use an LLM () to score the response relative to the Explore scoring across various chain configurations Run as Streamlit app Inputs Number of questions to auto-generate (if the user does not supply an eval set) Method for text splitting Chunk size for text splitting Chunk overlap for text splitting Embedding method for chunks Chunk retrieval method Neighbors for retrieval LLM for summarization of retrieved chunks Prompt choice for model self-grading Blog https://blog.langchain.dev/auto-eval-of-question-answering-tas

Quick Facts

Stars1,089
Forks93
LanguagePython
CategoryAgent Tool
Quality Score53.0788166791513/100
Open Issues3
Last Updated2023-05-10
Created2023-04-14
Platformspython
Est. Tokens~2967k

auto-evaluator alternative? Top 6 similar tools

Looking for a auto-evaluator alternative? If you're comparing auto-evaluator with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • agent-skills by tech-leads-club · ⭐ 4.8k

    The secure, validated skill registry for professional AI coding agents. Extend Antigravity, Claude Code, Curso

  • raptor by gadievron · ⭐ 3.2k

    Raptor turns Claude Code into a general-purpose AI offensive/defensive security agent. By using Claude.md and

  • skills by GuDaStudio · ⭐ 1.9k

    This repository contains a collection of Agent Skills developed by GudaStudio, enabling seamless collaboration

  • claude-forge by sangrokjung · ⭐ 767

    Supercharge Claude Code with 11 AI agents, 36 commands & 15 skills — the claude-code plugin framework inspired

  • excalidraw-diagram-skill by coleam00 · ⭐ 718

    Skill to give Claude Code (and any coding agent) the ability to generate beautiful and practical Excalidraw di

  • agent-skills by hashicorp · ⭐ 639

    A collection of Agent skills and Claude Code plugins for HashiCorp products.

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is auto-evaluator?

auto-evaluator is Evaluation tool for LLM QA chains. It is categorized as a Agent Tool with 1.1k GitHub stars.

What programming language is auto-evaluator written in?

auto-evaluator is primarily written in Python.

How do I install or use auto-evaluator?

You can find installation instructions and usage details in the auto-evaluator GitHub repository at github.com/rlancemartin/auto-evaluator. The project has 1.1k stars and 93 forks, indicating an active community.

What are the best alternatives to auto-evaluator?

The top alternatives to auto-evaluator on Agent Skills Hub include agent-skills, raptor, skills. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools