Best AI Agent Skills for Voice Agents in 2026

Q: Are these voice agents tools free to use?

Most voice agents tools listed are open source under permissive licenses (MIT, Apache 2.0). A handful offer paid managed/cloud versions on top of free self-hosted core. Always check the LICENSE file on each tool's GitHub repository before commercial use — some use AGPL or non-commercial restrictions that may not fit your deployment model.

Real-time voice AI agents — speech-in/speech-out conversational systems with Whisper, ElevenLabs, OpenAI Realtime API, and Anthropic voice tools.

🔍 Browse 30 voice agents tools ⭐ 104.3k total stars 🔄 Refreshed every 8h

⚡

Quick Pick — If you only pick one, go with jarvis ★ 1.4k — A 100% private AI voice assistant that lives on your computer (works offline). T

The Complete Guide to Voice Agents Tools (2026)

What Are Voice Agents Tools?

Voice Agents tools are AI-powered software designed to help developers and teams tackle voice agents-related tasks more efficiently. These tools are typically published as open-source projects on GitHub and can be integrated into existing workflows via MCP (Model Context Protocol), Claude Skills, or standalone agent frameworks. On Agent Skills Hub, we index 30 quality-scored voice agents tools across languages including Python, Rust, TypeScript.

Why Use Voice Agents Tools?

In 2026, the AI agent ecosystem is maturing rapidly. Voice Agents tools can significantly boost development efficiency by automating repetitive tasks, reducing human error, and providing intelligent suggestions. The top 3 tools — jarvis, bolna, adk-rust — have earned an average of 3,477 GitHub stars, reflecting strong community validation. 26 of the listed tools come with clear open-source licenses, ensuring freedom to use and modify.

How to Choose the Best Voice Agents Tool?

When choosing a voice agents tool, consider these factors: 1) Community activity — GitHub stars and recent commit frequency indicate reliability; 2) Integration method — check if it supports MCP, Claude, or your preferred agent framework; 3) Language compatibility — the most common language in this list is Python; 4) Quality score — Agent Skills Hub's composite score evaluates code quality, documentation completeness, and maintenance activity. Our recommendation: start with jarvis — it ranks highest in both star count and quality score.

Top 30 Voice Agents Tools

1 jarvis by isair

★ 1.4k Python MCP Server

A 100% private AI voice assistant that lives on your computer (works offline). Talk naturally as if Jarvis is a third person in the room, and get conversational responses. It remembers everything, knows location and time, can check the web, control Chrome, track nutrition, and more with support for unlimited MCPs / tools without context rot.

View Details → GitHub →

2 bolna by bolna-ai

★ 710 Python Agent Tool

Conversational voice AI agents

View Details → GitHub →

3 adk-rust by zavora-ai

★ 568 Rust Agent Tool

Rust Agent Development Kit (ADK-Rust): Build AI agents in Rust with modular components for models, tools, memory, realtime voice, and more. ADK-Rust is a flexible framework for developing AI agents with simplicity and power. Model-agnostic, deployment-agnostic, optimized for frontier AI models. Includes support for real-time voice agents.

View Details → GitHub →

4 airi by moeru-ai

★ 43.1k TypeScript Codex Skill

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minecraft, Factorio playing. Web / macOS / Windows supported.

View Details → GitHub →

5 agent-skills by livekit

★ 55 Python Agent Tool

Reusable AI coding agent skills for building voice AI with LiveKit

Quick Start: Skills activate automatically when agents detect relevant tasks (e.g., "build a voice agent", "create a LiveKit agent").

```bash
npx skills add livekit/agent-skills
```

View Details → GitHub →

6 com.niki914.nexus.agentic by Xposed-Modules-Repo

★ 150 MCP Server

Nexus | Take over your Android voice assistant with a custom AI agent — connect any LLM (Claude, GPT, Gemini), use MCP tools, Shell/SSH, and Root/Shizuku automation

View Details → GitHub →

7 Patter by PatterAI

★ 976 Python Codex Skill

Open-source voice-AI SDK. The Vapi/Retell alternative for builders who want to own the stack. Give your AI agent a phone number in 4 lines — Python and TypeScript, MIT licensed, Twilio, Telnyx, and Plivo.

View Details → GitHub →

8 OpenMontage by calesthio

★ 42.0k Python Agent Tool

World's first open-source, agentic video production system. 12 production pipelines, 100+ tools, 700+ agent skill and production-knowledge files. Turn your AI coding assistant into a full video production studio.

View Details → GitHub →

9 OpenVoiceUI by MCERQUA

★ 64 HTML Codex Skill

Voice-powered AI assistant platform — connect any LLM, any TTS, with a live web canvas, music generation, and agent orchestration using openclaw. Install: npx openvoiceui setup

View Details → GitHub →

10 openai-agents-js by openai

★ 3.5k TypeScript Agent Tool

A lightweight, powerful framework for multi-agent workflows and voice agents

View Details → GitHub →

11 claude-code-video-toolkit by digitalsamba

★ 1.7k Python Claude Skill

AI-native video production toolkit for Claude Code

View Details → GitHub →

12 elevenlabs-mcp by elevenlabs

★ 1.5k Python MCP Server

The official ElevenLabs MCP server

View Details → GitHub →

13 sag by steipete

★ 574 Go AI Tool

Like the macOS say command, but with a modern voice.

Quick Start: Homebrew (macOS): Go toolchain: Requires Go 1.24+.

```bash
brew install steipete/tap/sag  # auto-taps steipete/tap
```

View Details → GitHub →

14 UnrealGenAISupport by prajwalshettydev

★ 563 C++ MCP Server

Unreal Engine plugin for LLM/GenAI models & MCP UE5 server. OpenAI GPT-5, Deepseek R1, Claude Opus/Sonnet, Gemini 3, Grok 4, Alibaba Qwen, Kimi, ElevenLabs TTS, Inworld, OpenRouter, Groq, GLM, Ollama, Local, Meshy, Tripo, Hunyuan3D, Rodin, fal, Dashscope, Seedream. NPC AI, agentic, chat, 3D gen, TTS, multimodal, image gen. UnrealMCP/UnrealClaude

View Details → GitHub →

15 ai-skills by sanjay3290

★ 335 Python MCP Server

24 cross-platform agent skills for Claude Code, Cursor, Codex & Gemini CLI — databases, messaging, research, TTS, DevOps, and Google Workspace

View Details → GitHub →

16 sdk by vargHQ

★ 330 TypeScript Claude Skill

AI video generation SDK — JSX for videos. One API for Kling, Flux, ElevenLabs, Veed. Built on Vercel AI SDK.

View Details → GitHub →

17 llm by graniet

★ 352 Rust Agent Tool

A powerful Rust library and CLI tool to unify and orchestrate multiple LLM, Agent and voice backends (OpenAI, Claude, Gemini, Ollama, ElevenLabs...) with a single, extensible API. Build, chain, evaluate, and serve complex multi-step AI workflows — including speech-to-text, text-to-speech, completions, vision, and reasoning.

View Details → GitHub →

18 LocalText2Voice by estebanstifli

★ 143 Python Agent Tool

A complete local production workflow for clean narration, structured learning content, and podcast-ready audio

View Details → GitHub →

19 ElevenLabsKit by steipete

★ 111 Swift AI Tool

Swift SDK to stream ElevenLabs Voices

Quick Start:

```swift
import ElevenLabsKit

let client = ElevenLabsTTSClient(apiKey: "<api-key>")
let request = ElevenLabsTTSRequest(
    text: "Hello",
    modelId: "eleven_v3",
    outputFormat: "pcm_44100")

let stream = client.streamSynthesize(voiceId: "<voice-id>", request: request)
let sampleRate = TalkTTS

View Details → GitHub →

20 ralphy by alecs5am

★ 110 TypeScript MCP Server

🎬 Give AI agents tools to create viral videos. Influence at scale, from your terminal.

View Details → GitHub →

21 podcast-llm by evandempsey

★ 142 Python Agent Tool

Automatically generate engaging AI podcasts from nothing but an episode title.

View Details → GitHub →

22 openclaw-voice-call-realtime by TristanBrotherton

★ 57 TypeScript Codex Skill

Give your AI assistant a phone — OpenClaw plugin for real phone calls via Twilio + OpenAI Realtime, with in-call tools, transcripts, and call screening

View Details → GitHub →

23 mcp-tts by blacktop

★ 59 Go MCP Server

MCP Server for Text to Speech

View Details → GitHub →

24 OpenReels by tsensei

★ 64 TypeScript Agent Tool

Open-source AI pipeline that turns any topic into a publish-ready YouTube/Instagram/TikTok Short — research, script, voiceover, visuals, music, captions, and assembly in one command.

View Details → GitHub →

25 CyberVerse by Lynpoint

★ 1.5k Python Agent Tool

Self hosted, real-time digital human agent platform. Build voice-first AI agents with WebRTC, persona memory, tools, RAG, and optional digital-human video.

View Details → GitHub →

26 voicemode by mbailey

★ 1.3k Python MCP Server

Natural voice conversations with Claude Code

Quick Start: Requirements: Computer with microphone and speakers Option 1: Claude Code Plugin (Recommended) The fastest way for Claude Code users to get started: b...

View Details → GitHub →

27 CyberVerse by dsd2077

★ 1.1k Python Agent Tool

Self hosted, real-time digital human agent platform. Build voice-first AI agents with WebRTC, persona memory, tools, RAG, and optional digital-human video.

View Details → GitHub →

28 openclaw-nerve by daggerhashimoto

★ 821 TypeScript Codex Skill

Real-time web cockpit for OpenClaw: voice conversations, agent automated kanban board, workspace/file control, sub-agent sessions, inline charts, and usage visibility.

View Details → GitHub →

29 LLM-Agents-Ecosystem-Handbook by oxbshw

★ 530 Python MCP Server

One-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.

View Details → GitHub →

30 tongflow by tong-io

★ 607 TypeScript Agent Tool

TongFlow : An Open-Source Multi-Modal GenAI Workflow Studio

View Details → GitHub →

Comparison

Tool	Stars	Language	License	Score
jarvis	★ 1.4k	Python	—	67
bolna	★ 710	Python	MIT	66
adk-rust	★ 568	Rust	—	67
airi	★ 43.1k	TypeScript	MIT	74
agent-skills	★ 55	Python	MIT	62
com.niki914.nexus.agentic	★ 150	—	—	61
Patter	★ 976	Python	MIT	74
OpenMontage	★ 42.0k	Python	AGPL-3.0	82
OpenVoiceUI	★ 64	HTML	MIT	61
openai-agents-js	★ 3.5k	TypeScript	MIT	72
claude-code-video-toolkit	★ 1.7k	Python	MIT	76
elevenlabs-mcp	★ 1.5k	Python	MIT	76
sag	★ 574	Go	MIT	74
UnrealGenAISupport	★ 563	C++	MIT	58
ai-skills	★ 335	Python	Apache-2.0	70
sdk	★ 330	TypeScript	Apache-2.0	67
llm	★ 352	Rust	MIT	54
LocalText2Voice	★ 143	Python	MIT	70
ElevenLabsKit	★ 111	Swift	MIT	70
ralphy	★ 110	TypeScript	Apache-2.0	69
podcast-llm	★ 142	Python	—	37
openclaw-voice-call-realtime	★ 57	TypeScript	MIT	70
mcp-tts	★ 59	Go	MIT	53
OpenReels	★ 64	TypeScript	MIT	48
CyberVerse	★ 1.5k	Python	GPL-3.0	69
voicemode	★ 1.3k	Python	MIT	72
CyberVerse	★ 1.1k	Python	GPL-3.0	67
openclaw-nerve	★ 821	TypeScript	MIT	62
LLM-Agents-Ecosystem-Handbook	★ 530	Python	MIT	78
tongflow	★ 607	TypeScript	AGPL-3.0	69

Related Categories

Frequently Asked Questions

What are the best voice agents tools in 2026?

The top voice agents tools in 2026 are jarvis, bolna, adk-rust. Agent Skills Hub ranks 30 options by GitHub stars, quality score (6 dimensions including completeness, examples, and agent readiness), and recent activity. The list is rebuilt every 8 hours from live GitHub data.

How do I choose between jarvis and bolna?

jarvis (1.4k stars) is the most adopted choice for general voice agents workflows, written in Python. bolna (710 stars) is a strong alternative. Pick by your existing stack: match the language and runtime your team already uses to minimize integration cost. If unsure, start with jarvis — it has the deepest community and the most examples online.

When should I NOT use a voice agents tool?

Avoid pre-built voice agents tools when (1) your use case requires deep customization that the tool's plugin system doesn't support, (2) you have strict compliance requirements that ban third-party dependencies, (3) the tool's maintenance is inactive (last commit >6 months ago), or (4) your data volume is small enough that a 50-line custom script is cheaper than learning the tool. For most production workflows above 100 requests/day, the time savings from a maintained tool outweigh the customization loss.

What's the difference between voice agents and text-to-speech & voice?

Voice Agents focuses specifically on real-time voice ai agents — speech-in/speech-out conversational systems with whisper, elevenlabs, openai realtime api, and anthropic voice tools. Text-to-Speech & Voice is a related but distinct category — see https://agentskillshub.top/best/text-to-speech/ for those tools. The two often appear in the same agent pipeline but solve different problems: choose voice agents when your primary goal is the specific task, and text-to-speech & voice when the workflow is broader.

Is jarvis better than building it yourself?

For most teams, yes. jarvis has 1.4k stars worth of community testing, handles edge cases you haven't thought of, and ships with documentation. Build your own only when (1) your requirements are deeply non-standard, (2) you have a security/compliance reason to avoid OSS dependencies, or (3) the maintenance burden is small enough (<200 lines of code) that you'll save time long-term. The break-even point is usually around 2-3 weeks of dev time saved.

Are these voice agents tools free to use?

Most voice agents tools listed are open source under permissive licenses (MIT, Apache 2.0). A handful offer paid managed/cloud versions on top of free self-hosted core. Always check the LICENSE file on each tool's GitHub repository before commercial use — some use AGPL or non-commercial restrictions that may not fit your deployment model.

Best AI Agent Skills for Voice Agents in 2026

The Complete Guide to Voice Agents Tools (2026)

What Are Voice Agents Tools?

Why Use Voice Agents Tools?

How to Choose the Best Voice Agents Tool?

Top 30 Voice Agents Tools

Comparison

Related Categories

Frequently Asked Questions

Get Weekly AI Tool Picks