llm-srbench — Agent Tool by deep-symbolic-mathematics

by deep-symbolic-mathematics · Agent Tool · ★ 95

Last updated: 2025-07-31 · Indexed by AgentSkillsHub · Auto-synced every 8h

About llm-srbench

: Benchmark for Scientific Equation Discovery or Symbolic Regression with LLMs This is the official repository for the paper "LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models" (ICML 2025 Oral) Overview In this paper, we introduce LLM-SRBench, a comprehensive benchmark with $239$ challenging problems across four scientific domains specifically designed to evaluate LLM-based scientific equation discovery methods while preventing trivial memorization. Our benchmark comprises two main categories: LSR-Transform, which transforms common physical models into less common mathematical representations to test reasoning beyond memorized forms, and LSR-Synth, which introduces synthetic, discovery-driven problems requiring data-driven reasoning. Updates 9 June, 2025: 🌟 LLM-SRBench is selected for Oral presentation (top 1%) at ICML 2025 1 May, 2025: 🌟 LLM-SRBench is accepted for Spotlight poster at ICML 2025 16 Apr, 2025: 🌟 LLM-SRBench data and evaluation code i

ai4code ai4math ai4science large-language-models llm-agent scientific-discovery

Quick Facts

Stars	95
Forks	11
Language	Python
Category	Agent Tool
Quality Score	40.25/100
Open Issues	4
Last Updated	2025-07-31
Created	2025-01-30
Platforms	python
Est. Tokens	~84k

llm-srbench alternative? Top 6 similar tools

Looking for a llm-srbench alternative? If you're comparing llm-srbench with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

promptdesk by promptdesk · ⭐ 96
Promptdesk is a tool designed for effectively creating, organizing, and evaluating prompts and large language
atlas-mcp-server by cyanheads · ⭐ 467
A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - imple
edsl by expectedparrot · ⭐ 466
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market
sagify by Kenza-AI · ⭐ 442
LLMs and Machine Learning done easily
agentlego by InternLM · ⭐ 403
Enhance LLM agents with rich tool APIs
Awesome-Scientific-Skills by InternScience · ⭐ 396
An open, curated collection of Agent Skills for scientific research — clone it, use it, extend it!

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

TrendRadar ⭐ 59.7k · MCP Server
gpt-researcher ⭐ 27.4k · MCP Server
Scrapling ⭐ 64.6k · MCP Server
serena ⭐ 25.5k · MCP Server
MaxKB ⭐ 21.4k · MCP Server

Frequently Asked Questions

What is llm-srbench?

llm-srbench is [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models. It is categorized as a Agent Tool with 95 GitHub stars.

What programming language is llm-srbench written in?

llm-srbench is primarily written in Python. It covers topics such as ai4code, ai4math, ai4science.

How do I install or use llm-srbench?

You can find installation instructions and usage details in the llm-srbench GitHub repository at github.com/deep-symbolic-mathematics/llm-srbench. The project has 95 stars and 11 forks, indicating an active community.

What are the best alternatives to llm-srbench?

The top alternatives to llm-srbench on Agent Skills Hub include promptdesk, atlas-mcp-server, edsl. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools