Best AI Agent Skills for Text-to-Speech & Voice

Find AI tools for text-to-speech synthesis, voice cloning, speech recognition, and audio processing.

Top 10 Text-to-Speech & Voice Tools

1 voicemode by mbailey
★ 916 Python MCP Server

Natural (2-way) voice conversations with Claude Code

View Details → GitHub →
2 SpeakItAI by loglux
★ 46 Python Agent Tool

Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and a simple Gradio web interface.

View Details → GitHub →
3 vllm-mlx by waybarrios
★ 639 Python MCP Server

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

View Details → GitHub →
4 ChatTTS by 2noise
★ 38.9k Python Agent Tool

A generative speech model for daily dialogue.

View Details → GitHub →
5 MiniMax-MCP by MiniMax-AI
★ 1.3k Python MCP Server

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

View Details → GitHub →
6 vox by app-vox
★ 22 TypeScript Agent Tool

Your privacy-first voice-to-text tool; Local Whisper transcription with optional LLM enhancement so your audio never leaves your computer 💜

View Details → GitHub →
7 skills by team-telnyx
★ 153 Shell Agent Tool

Official Telnyx skills for AI coding agents

View Details → GitHub →
8 telnyx-skills by team-telnyx
★ 150 Shell Agent Tool

Official Telnyx skills for AI coding agents

View Details → GitHub →
9 unsloth by unslothai
★ 58.5k Python AI Tool

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

View Details → GitHub →
10 openwhispr by OpenWhispr
★ 2.1k TypeScript Agent Tool

Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and available cross-platform.

View Details → GitHub →

Comparison

Tool Stars Language License Score
voicemode ★ 916 Python MIT 41
SpeakItAI ★ 46 Python 31
vllm-mlx ★ 639 Python 40
ChatTTS ★ 38.9k Python AGPL-3.0 40
MiniMax-MCP ★ 1.3k Python MIT 44
vox ★ 22 TypeScript 31
skills ★ 153 Shell MIT 38
telnyx-skills ★ 150 Shell MIT 38
unsloth ★ 58.5k Python Apache-2.0 48
openwhispr ★ 2.1k TypeScript MIT 41

Related Categories

Content Writing Translation Summarization

Frequently Asked Questions

What are the best AI tools for text-to-speech & voice?

The top text-to-speech & voice tools include voicemode, SpeakItAI, vllm-mlx. These are ranked by our composite score based on GitHub stars, community activity, and code quality.

Are these text-to-speech & voice tools free to use?

Most tools listed here are open-source. 7 out of 10 have explicit open-source licenses, making them free to use and modify.

How do I choose the right text-to-speech & voice tool?

Consider your tech stack (language compatibility), project scale (stars indicate community trust), and specific features you need. Use the comparison table above to evaluate side by side.

Get Weekly AI Tool Picks

Top 20 fastest-growing AI tools delivered every Monday. Free.

No spam, unsubscribe anytime.

Explore All 25,000+ Skills on Agent Skills Hub