MLLM-Tool — LLM Plugin by Chenyu-Wang567

by Chenyu-Wang567 · LLM Plugin · ★ 138

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About MLLM-Tool

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning Chenyu Wang, Weixin Luo, Qianyu Chen, Haonan Mai, Jindi Guo, Sixun Dong, Xiaohua (Michael) Xuan, Zhengxin Li, Lin Ma, Shenghua Gao. ShanghaiTech University && Meituan && UniDT This repository hosts the code, data and model weight of MLLM-Tool, the first tool agent MLLM that has the ability to perceive visual- and auditory- input information and recommend appropriate tools for multi-modal instructions. 🎉 News [x] [2024.02.02] 📢📢 We change the permissions for data and checkpoints, no longer need to apply to download them. [x] [2024.01.16] 🚀🚀 Release the code of MLLM-Tool. [x] [2024.01.16] 🔨🧩 Release the ToolMMBench dataset. [x] [2024.01.16] 📢📢 Release the checkpoint of MLLM-Tool in Vicuna-7B, Vicuna-13B, Llama-7B, Llama-13B, Llama2-7B, Llama2-13B, Llama2Chat-7B, Llama2Chat-13B. 👉 TODO [ ] Collect more data and release the v2 dataset. [ ] Update MLLM-Tool in more types & sizes of LLMs. [ ] Empower MLLM-Tool with retrieving open-set tools. [ ] Release Demo and Inter

gpt4llmlmmtool-agent

Quick Facts

Stars138
Forks4
LanguagePython
CategoryLLM Plugin
LicenseMIT
Quality Score47.2/100
Open Issues5
Last Updated2025-10-10
Created2024-01-08
Platformspython
Est. Tokens~193k

Compatible Skills

These tools work well together with MLLM-Tool for enhanced workflows:

  • VisualAgentBench — semantic(0.34)+complementary+same_lang+similar_pop+shared_platform (57%)
  • groundingLMM — semantic(0.18)+complementary+rare_topics+same_lang+similar_pop+shared_platform (56%)
  • VideoGLaMM — semantic(0.18)+complementary+rare_topics+same_lang+similar_pop+shared_platform (56%)

MLLM-Tool alternative? Top 6 similar tools

Looking for a MLLM-Tool alternative? If you're comparing MLLM-Tool with other llm plugin tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • DyLAN by SALT-NLP · ⭐ 196

    Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Opt

  • just-eval by Re-Align · ⭐ 90

    A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.

  • wcgw by rusiaaman · ⭐ 655

    Shell and coding agent on mcp clients

  • vibe by mondaycom · ⭐ 653

    🎨 Vibe Design System - Official monday.com UI resources for application development in React.js

  • chatgpt-copilot by feiskyer · ⭐ 159

    ChatGPT Copilot Extension for Visual Studio Code

  • VideoGLaMM by mbzuai-oryx · ⭐ 97

    [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

More LLM Plugin Tools

Explore other popular llm plugin tools:

View all LLM Plugin tools →

Popular Python Agent Tools

Frequently Asked Questions

What is MLLM-Tool?

MLLM-Tool is MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning. It is categorized as a LLM Plugin with 138 GitHub stars.

What programming language is MLLM-Tool written in?

MLLM-Tool is primarily written in Python. It covers topics such as gpt4, llm, lmm.

How do I install or use MLLM-Tool?

You can find installation instructions and usage details in the MLLM-Tool GitHub repository at github.com/Chenyu-Wang567/MLLM-Tool. The project has 138 stars and 4 forks, indicating an active community.

What license does MLLM-Tool use?

MLLM-Tool is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to MLLM-Tool?

The top alternatives to MLLM-Tool on Agent Skills Hub include DyLAN, just-eval, wcgw. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse LLM Plugin tools