llm-srbench — Agent Tool by deep-symbolic-mathematics

by deep-symbolic-mathematics · Agent Tool · ★ 95

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About llm-srbench

: Benchmark for Scientific Equation Discovery or Symbolic Regression with LLMs This is the official repository for the paper "LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models" (ICML 2025 Oral) Overview In this paper, we introduce LLM-SRBench, a comprehensive benchmark with $239$ challenging problems across four scientific domains specifically designed to evaluate LLM-based scientific equation discovery methods while preventing trivial memorization. Our benchmark comprises two main categories: LSR-Transform, which transforms common physical models into less common mathematical representations to test reasoning beyond memorized forms, and LSR-Synth, which introduces synthetic, discovery-driven problems requiring data-driven reasoning. Updates 9 June, 2025: 🌟 LLM-SRBench is selected for Oral presentation (top 1%) at ICML 2025 1 May, 2025: 🌟 LLM-SRBench is accepted for Spotlight poster at ICML 2025 16 Apr, 2025: 🌟 LLM-SRBench data and evaluation code i

ai4codeai4mathai4sciencelarge-language-modelsllm-agentscientific-discovery

Quick Facts

Stars95
Forks11
LanguagePython
CategoryAgent Tool
Quality Score40.25/100
Open Issues4
Last Updated2025-07-31
Created2025-01-30
Platformspython
Est. Tokens~84k

llm-srbench alternative? Top 6 similar tools

Looking for a llm-srbench alternative? If you're comparing llm-srbench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • promptdesk by promptdesk · ⭐ 96

    Promptdesk is a tool designed for effectively creating, organizing, and evaluating prompts and large language

  • atlas-mcp-server by cyanheads · ⭐ 467

    A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - imple

  • edsl by expectedparrot · ⭐ 466

    Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market

  • sagify by Kenza-AI · ⭐ 442

    LLMs and Machine Learning done easily

  • agentlego by InternLM · ⭐ 403

    Enhance LLM agents with rich tool APIs

  • Awesome-Scientific-Skills by InternScience · ⭐ 396

    An open, curated collection of Agent Skills for scientific research — clone it, use it, extend it!

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is llm-srbench?

llm-srbench is [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models. It is categorized as a Agent Tool with 95 GitHub stars.

What programming language is llm-srbench written in?

llm-srbench is primarily written in Python. It covers topics such as ai4code, ai4math, ai4science.

How do I install or use llm-srbench?

You can find installation instructions and usage details in the llm-srbench GitHub repository at github.com/deep-symbolic-mathematics/llm-srbench. The project has 95 stars and 11 forks, indicating an active community.

What are the best alternatives to llm-srbench?

The top alternatives to llm-srbench on Agent Skills Hub include promptdesk, atlas-mcp-server, edsl. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools