by deep-symbolic-mathematics · Agent Tool · ★ 95
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
: Benchmark for Scientific Equation Discovery or Symbolic Regression with LLMs This is the official repository for the paper "LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models" (ICML 2025 Oral) Overview In this paper, we introduce LLM-SRBench, a comprehensive benchmark with $239$ challenging problems across four scientific domains specifically designed to evaluate LLM-based scientific equation discovery methods while preventing trivial memorization. Our benchmark comprises two main categories: LSR-Transform, which transforms common physical models into less common mathematical representations to test reasoning beyond memorized forms, and LSR-Synth, which introduces synthetic, discovery-driven problems requiring data-driven reasoning. Updates 9 June, 2025: 🌟 LLM-SRBench is selected for Oral presentation (top 1%) at ICML 2025 1 May, 2025: 🌟 LLM-SRBench is accepted for Spotlight poster at ICML 2025 16 Apr, 2025: 🌟 LLM-SRBench data and evaluation code i
| Stars | 95 |
| Forks | 11 |
| Language | Python |
| Category | Agent Tool |
| Quality Score | 40.25/100 |
| Open Issues | 4 |
| Last Updated | 2025-07-31 |
| Created | 2025-01-30 |
| Platforms | python |
| Est. Tokens | ~84k |
Looking for a llm-srbench alternative? If you're comparing llm-srbench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Promptdesk is a tool designed for effectively creating, organizing, and evaluating prompts and large language
A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - imple
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market
LLMs and Machine Learning done easily
Enhance LLM agents with rich tool APIs
An open, curated collection of Agent Skills for scientific research — clone it, use it, extend it!
Explore other popular agent tool tools:
llm-srbench is [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models. It is categorized as a Agent Tool with 95 GitHub stars.
llm-srbench is primarily written in Python. It covers topics such as ai4code, ai4math, ai4science.
You can find installation instructions and usage details in the llm-srbench GitHub repository at github.com/deep-symbolic-mathematics/llm-srbench. The project has 95 stars and 11 forks, indicating an active community.
The top alternatives to llm-srbench on Agent Skills Hub include promptdesk, atlas-mcp-server, edsl. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.