by jeinlee1991 · Agent Tool · ★ 6.0k
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax-M2.7、deepseek-v4、Qwen3.6、llama4、智谱GLM-5.1、MiMo-V2、LongCat、gemma4、mistral等开源大模型。不仅提供排行榜,也提供规模超200万的大模型缺陷库!方便广大社区研究分析、改进大模型。
| Stars | 6,005 |
| Forks | 242 |
| Category | Agent Tool |
| Quality Score | 31.7/100 |
| Open Issues | 13 |
| Last Updated | 2026-05-12 |
| Created | 2023-06-04 |
| Platforms | claude-code, gemini |
| Est. Tokens | ~2147484k |
Looking for a chinese-llm-benchmark alternative? If you're comparing chinese-llm-benchmark with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview prepara
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.
The LLM Anti-Framework
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
Explore other popular agent tool tools:
chinese-llm-benchmark is ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、MiniMax. It is categorized as a Agent Tool with 6.0k GitHub stars.
You can find installation instructions and usage details in the chinese-llm-benchmark GitHub repository at github.com/jeinlee1991/chinese-llm-benchmark. The project has 6.0k stars and 242 forks, indicating an active community.
The top alternatives to chinese-llm-benchmark on Agent Skills Hub include PocketFlow, generative-ai, swarms. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.