llm_counts — Agent Tool by harleyszhang

by harleyszhang · Agent Tool · ★ 114

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About llm_counts

llmprofiler llm theoretical performance analysis tools and support params, flops, memory and latency analysis. 主要功能 支持 qwen2.5、qwen3 dense 系列模型。 支持张量并行推理模式。 支持 、、 等硬件以及主流 decoder-only 的自回归模型,可自行在配置文件中增加。 支持分析性能瓶颈,不同 是 还是 ,以及 的性能瓶颈。 支持输出每层和整个模型的参数量、计算量,内存和 。 推理时支持预填充和解码阶段分别计算内存和 latency、以及理论支持的最大 等等。 支持设置计算效率、内存读取效率(不同推理框架可能不一样,这个设置好后,可推测输出实际值)。 推理性能理论分析结果的格式化输出。 如何使用 使用方法,直接调用 文件中函数 函数并输入相关参数即可。 python def llmprofile(modelname="llama-13b", gpuname: str = "v100-sxm-32gb", bytesperparam: int = BYTESFP16, bs: int = 1, seqlen: int = 522, generatelen=1526, dszero: int = 0, dpsize: int = 1, tpsize: int = 1, ppsize: int = 1, spsize: int = 1, layernormdtypebytes: int = BYTESFP16, kvcachebytes: int = BYTESFP16, flopsefficiency: float = FLOPSEFFICIENCY, hbmmemoryefficiency: float = HBMMEMORYEFFICIENCY, intranodememoryefficiency=INTRANODEMEMORYEFFICIENCY, internodememoryefficiency=INTERNODEMEMORYEFFICIENCY, mode: str = "inference", ) - dict: """format print dicts of the total floating-point operations, MACs,

gpu-performancellamallmllm-inferenceprofilerpython3transformer

Quick Facts

Stars114
Forks10
LanguagePython
CategoryAgent Tool
Quality Score38.25/100
Open Issues1
Last Updated2025-07-11
Created2023-07-26
Platformspython
Est. Tokens~505k

Compatible Skills

These tools work well together with llm_counts for enhanced workflows:

  • vllm-cli — semantic(0.23)+complementary+rare_topics+same_lang+similar_pop+shared_platform (58%)
  • Noema-Declarative-AI — semantic(0.16)+complementary+rare_topics+same_lang+similar_pop+shared_platform (55%)
  • ai-agents-reality-check — semantic(0.19)+complementary+same_lang+similar_pop+shared_platform (52%)
  • data-analysis-llm-agent — semantic(0.18)+complementary+same_lang+similar_pop+shared_platform (51%)

llm_counts alternative? Top 6 similar tools

Looking for a llm_counts alternative? If you're comparing llm_counts with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • LLMOne by EM-GeekLab · ⭐ 87

    Enterprise-grade LLM automated deployment tool that makes AI servers truly "plug-and-play".

  • LLM-VM by anarchy-ai · ⭐ 490

    irresponsible innovation. Try now at https://chat.dev/

  • vllm-cli by Chen-zexi · ⭐ 480

    A command-line interface tool for serving LLM using vLLM.

  • WorkflowAI by WorkflowAI · ⭐ 470

    WorkflowAI is an open-source platform where product and engineering teams 
collaborate to build and iterate on

  • edsl by expectedparrot · ⭐ 466

    Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market

  • mlx-llm by riccardomusmeci · ⭐ 459

    Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is llm_counts?

llm_counts is llm theoretical performance analysis tools and support params, flops, memory and latency analysis.. It is categorized as a Agent Tool with 114 GitHub stars.

What programming language is llm_counts written in?

llm_counts is primarily written in Python. It covers topics such as gpu-performance, llama, llm.

How do I install or use llm_counts?

You can find installation instructions and usage details in the llm_counts GitHub repository at github.com/harleyszhang/llm_counts. The project has 114 stars and 10 forks, indicating an active community.

What are the best alternatives to llm_counts?

The top alternatives to llm_counts on Agent Skills Hub include LLMOne, LLM-VM, vllm-cli. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools