Find AI tools for text-to-speech synthesis, voice cloning, speech recognition, and audio processing.
Convert text to speech using Microsoft Azure Neural Text-to-Speech (TTS) and a simple Gradio web interface.
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
Your privacy-first voice-to-text tool; Local Whisper transcription with optional LLM enhancement so your audio never leaves your computer 💜
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and available cross-platform.
| Tool | Stars | Language | License | Score |
|---|---|---|---|---|
| voicemode | ★ 916 | Python | MIT | 41 |
| SpeakItAI | ★ 46 | Python | — | 31 |
| vllm-mlx | ★ 639 | Python | — | 40 |
| ChatTTS | ★ 38.9k | Python | AGPL-3.0 | 40 |
| MiniMax-MCP | ★ 1.3k | Python | MIT | 44 |
| vox | ★ 22 | TypeScript | — | 31 |
| skills | ★ 153 | Shell | MIT | 38 |
| telnyx-skills | ★ 150 | Shell | MIT | 38 |
| unsloth | ★ 58.5k | Python | Apache-2.0 | 48 |
| openwhispr | ★ 2.1k | TypeScript | MIT | 41 |
The top text-to-speech & voice tools include voicemode, SpeakItAI, vllm-mlx. These are ranked by our composite score based on GitHub stars, community activity, and code quality.
Most tools listed here are open-source. 7 out of 10 have explicit open-source licenses, making them free to use and modify.
Consider your tech stack (language compatibility), project scale (stars indicate community trust), and specific features you need. Use the comparison table above to evaluate side by side.
Top 20 fastest-growing AI tools delivered every Monday. Free.
No spam, unsubscribe anytime.