by madroidmaq · Agent Tool · ★ 714
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
MLX Omni Server Local AI inference server optimized for Apple Silicon MLX Omni Server provides dual API compatibility with both OpenAI and Anthropic APIs, enabling seamless local inference on Apple Silicon using the MLX framework. Installation • Quick Start • Documentation • Contributing ✨ Features 🚀 Apple Silicon Optimized - Built on MLX framework for M1/M2/M3/M4 chips 🔌 Dual API Support - Compatible with both OpenAI and Anthropic APIs 🎯 Complete AI Suite - Chat, audio processing, image generation, embeddings ⚡ High Performance - Local inference with hardware acceleration 🔐 Privacy-First - All processing happens locally on your machine 🛠 Drop-in Replacement - Works with existing OpenAI and Anthropic SDKs 🚀 Installation ⚡ Quick Start Start the server: Choose your p
| Stars | 714 |
| Forks | 87 |
| Language | Python |
| Category | Agent Tool |
| License | MIT |
| Quality Score | 42.75/100 |
| Open Issues | 18 |
| Last Updated | 2026-05-09 |
| Created | 2024-11-05 |
| Platforms | cli, python |
| Est. Tokens | ~371k |
These tools work well together with mlx-omni-server for enhanced workflows:
Looking for a mlx-omni-server alternative? If you're comparing mlx-omni-server with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
🛠 openai function calling tools for JS/TS
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL,
Schema-Guided Reasoning (SGR) has agentic system design created by neuraldeep community
Labs to explore AI Models, MCP servers, and Agents with the AI Gateway powered by Azure API Management and Mic
Introducing the Assistant Swarm. An extension to the OpenAI Node SDK to automatically delegate work to any ass
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
Explore other popular agent tool tools:
mlx-omni-server is MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless. It is categorized as a Agent Tool with 714 GitHub stars.
mlx-omni-server is primarily written in Python. It covers topics such as function-calling, genai, mlx.
You can find installation instructions and usage details in the mlx-omni-server GitHub repository at github.com/madroidmaq/mlx-omni-server. The project has 714 stars and 87 forks, indicating an active community.
mlx-omni-server is released under the MIT license, making it free to use and modify according to the license terms.
The top alternatives to mlx-omni-server on Agent Skills Hub include openai-function-calling-tools, vllm-mlx, sgr-agent-core. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.