Open Source Voice Agents

The best 50 Open Source Voice Agents AI tools - Free & Paid

Free AI tools 💸 All categories 🎨 Deals ％ For you 👀

Explore 50 AI for Open Source Voice Agents

Free Only

Open Voice OS

4 0

Open Voice OS is an open-source, community-driven voice AI platform for building customizable assistants across Raspberry Pi, embedded devices, Linux desktops, and Docker. It provides plugin-based STT/TTS, configurable wake words, extensible skills, and privacy-focused self-hosting.

Voice

Free

Voice.ai

16 3

Voice.ai offers cloud‑and on‑prem AI voice agents for calls, scheduling, and queries, supporting 15+ languages. It provides text‑to‑speech, 10‑second voice cloning, real‑time voice change, noise filtering, and integrates with Salesforce, HubSpot, Zendesk, Slack. APIs and SDKs enable scalable deploym

Voice

Freemium - $5/mo

VoiceBox

3 0

Voicebox is an open-source desktop app for voice cloning and TTS that clones voices from short samples, supports WAV/MP3/FLAC/WEBM and mic capture, multi-voice timeline editing with effects, local or remote GPU inference, Whisper STT, and API integration.

Voice

Free

ElevenLabs

18 3 1

ElevenCreative is an AI tool that generates ultra-realistic speech, videos, music, and sound effects, offering text-to-speech, voice cloning, and a library of pre-recorded voices for creating personalized content for various applications.

Audio generation

Freemium - $5/mo

11ai

11 ai is a voice assistant using ElevenLabs Agents that enables voice-driven task management, customer research, ticket updates, and team messaging via integrations with Perplexity, Linear, and Slack, supporting private MCP servers and fast voice cloning across 5,000+ voices.

AI Agents

Freemium

Voice Design AI

Free text‑to‑speech platform supporting advanced AI models. Offers real‑time, natural‑sounding voice with emotion, multi‑language, and voice‑cloning. Users adjust pitch, speed, and parameters. API integration for podcasts, audiobooks, assistants, e‑learning, accessibility.

Text-to-speech

Free

Free Text-To-Speech

2 0

A web‑based Microsoft AI TTS tool offering 330+ neural voices in 129 languages. Users can adjust rate, pitch, pauses, and style for news, scripts, or narration. Works across Chrome, Firefox, Edge, with an API for web integration.

Customer support

Free

Related topics: 🔍 human-like voice generator 🔍 voice-activated chatbot 🔍 voice assistant platform 🔍 github voice recognition 🔍 voice ai platform 🔍 voice-powered chatbot

Deepseek

43 2 1

DeepSeek-V3 is an advanced AI model offering leading performance in open source LLM, enhanced speed, and global language support. It sets new benchmarks for inference speed among open-source models.

Leading AI Assistants

Free

Deepgram Voice AI

Deepgram Voice AI offers real‑time and batch speech‑to‑text, text‑to‑speech, and voice‑agent APIs. It delivers low‑latency transcripts, natural‑sounding synthesis, and integrated conversation handling for contact centers, transcription, and podcasts, with cloud, on‑prem, and telephony support.

Text-to-speech

Freemium

Voiceflow

15 5

Voiceflow enables teams to create, test, and deploy AI‑powered conversational agents across chat, voice, phone, and web without coding. Its visual editor, real‑time collaboration, and secure deployment pipelines streamline design, evaluation, and omnichannel rollout.

Chat

Free - $50/mo

Dograh

4 0

Dograh is an open-source VAPI alternative for building self-deployed AI voice agents, offering a no-code drag-and-drop builder, telephony and multilingual (30+) support, voice customization, advanced NLP with intent handling, intelligent human routing, and real-time analytics.

AI Agents

Freemium

OpenAI.fm

22 6

OpenAI.fm is an interactive text-to-speech demo that lets users explore various voice styles and emotional tones, enhancing storytelling in gaming and multimedia by enabling customizable audio outputs with dynamic pacing and expressive characteristics.

Text-to-speech

Freemium

Claude AI

32 10 2

Claude is an advanced AI assistant designed for a variety of tasks, including code generation, writing, productivity enhancement, and business automation. It is highly adaptable, intelligent, and customizable to meet diverse user needs.

Leading AI Assistants

Freemium - $18/mo

AnySpeech.io

AnySpeech.io is an AI voice studio offering 100+ multilingual, style-controlled voices for content creation. It generates export-ready audio for videos, podcasts, and e-learning to save production time and ensure consistent quality.

Text-to-speech

Free trial - $99/mo

008

Voice AI platform that builds conversational agents in five clicks, automating support, sales, and billing calls. It integrates natively with CRMs and databases for real‑time actions, supports multi‑OS softphones, and records transcriptions for audits.

AI Assistant

Free

VoiSpark

2 2

VoiSpark is an AI voice generator for text-to-speech and voice cloning, offering 500+ natural voices in 30+ languages. It enables custom emotions, styles, and unique vocal identities, with seamless integration for voiceovers in videos, podcasts, and apps.

Voice

Freemium - $9.9/mo

Kokoro Web

Kokoro Web is an open-source AI voice generator offering multilingual text-to-speech capabilities with customizable accents. It features user-defined input profiles, self-hosting options, and model quantization for optimized performance, catering to developers and content creators.

Text-to-speech

Free

Voicemaker

13 1 1

Voicemaker is a cloud‑based text‑to‑speech platform offering 1,500+ AI voices in 130+ languages. It lets users adjust pitch, speed, pauses, add effects, clone voices with a minute of audio, and export to MP3, WAV, OGG, AAC, or OPUS.

Text-to-Speech

Freemium

SpeechGen

22 7

SpeechGen.io converts up to 2 million characters into high‑quality neural‑voice audio across 150 languages with 5,000 models. It allows voice, speed, pitch, volume control, SSML tags, background music, multi‑speaker tagging, downloadable formats, and a REST API.

Text-to-speech

Paid - $4.99

Duck.ai

3 0

Duck.ai offers anonymous access to popular AI models. It ensures privacy by keeping conversations untracked and outside AI training data, with seamless model switching.

AI Assistant

Free

kikivoice.ai

2 3

KikiVoice is an AI voice cloning tool designed for creators, enabling rapid generation of realistic voice clones from short audio samples. It offers versatile models for various applications, including voiceovers and multilingual content creation.

Voice

SoundHound AI

SoundHound AI is a conversational voice AI platform that provides voice assistants, developer tools, and enterprise AI agents capable of listening, reasoning, and acting. It enables custom voice experiences across industries like automotive, restaurants, and contact centers, with features including

Voice

Freemium

uncensored.com

17 5

Uncensored AI delivers a chat platform featuring Claude Opus, Gemini, Grok, and MiniMax M2‑Her. It supports text, audio, image, and code interactions, including image‑to‑video via Image Studio. API beta and usage stats benefit developers, writers, educators, and researchers.

Chat

Freemium

WellSaid.io

WellSaid converts scripts into natural speech with 120+ licensed voices, tone/speed/pronunciation controls, and Studio plus API for real-time generation, editing, collaboration and integrations—supporting scalable, consistent voiceovers for e-learning, IVR, apps, and video.

Text-to-speech

Free

LiveKit

LiveKit is an open-source framework and cloud platform for building and hosting low-latency real-time voice, video and physical AI agents, offering a media server, WebRTC SDKs, TTS/STT and telephony connectors, scalable hosting and programmatic APIs.

Voice

Subscription

ZeroBot

3 3

ZeroBot lets users create role‑specific AI agents with custom voice, avatar, and behavior, supporting GPT‑5, Gemini, Claude, Llama, and Qwen. It offers actions, connectors, web search, image generation, and human‑backed verification for secure, versatile use.

Chat

Paid

MiniMax

17 12

MiniMax is an AI platform providing text, speech, video and music models for developers and creators — supporting agentic text workflows, real-time speech synthesis and voice cloning, emotion-aware video rendering, and precise vocal/instrument music generation via APIs and SDKs.

AI Agents

Freemium

StarVoiceAi

0 1

StarVoice is an AI voice generator that lets users create celebrity‑style vocal clips and clone their own voice. It offers a licensed voice library, daily new characters, multi‑language TTS, and community support.

Audio generation

Free - $9.97

YesChat AI

19 6

YesChat.ai unifies chat, music, video, and image generation in a browser platform, offering DeepSeek‑R1, GPT‑4o, and Claude 3.5 Sonnet for conversation, royalty‑free music from text, text‑to‑video, and image creation. It supports languages and customizable bots for research and marketing.

Chat

Subscription

Play.ht

19 9

PlayAI turns text into natural‑sounding audio in 42+ languages using 800+ voices. Users adjust pitch, rate, volume, add SSML pronunciations, support multi‑speaker real‑time synthesis, voice cloning, and API integration for chatbots, streaming, IVR, e‑learning.

Text-To-Speech

Free trial - $29/mo

Hume AI

13 6

Hume AI offers emotion‑intelligent text‑to‑speech, real‑time speech‑to‑speech, and expressive voice cloning across 100+ languages. Developers use TypeScript, Python, .NET, or Swift SDKs to build voice‑design, stage‑direction, and emotion‑analysis features for content creation.

AI Assistant

Freemium - $3/mo

OpenAssistantGPT

1 0

OpenAssistantGPT is an open‑source SaaS that lets you build no‑code AI chatbots for websites. It auto‑crawls a URL, supports GPT‑4/3.5, handles file attachments, SAML/SSO, API calls, and web search, with GitHub source and Vercel SDK.

Chatbot builder

Freemium - $18/mo

Sigma AI

SigmaMind AI builds production voice agents without code, delivering sub‑800 ms latency and real‑time tool orchestration. It integrates with databases, CRMs, and APIs, and supports enterprise features like SOC 2 compliance, encryption, private cloud, and SIP trunking for scalable multichannel suppor

Customer support

Freemium

LOVO AI

20 6

LOVO converts text to speech using 500+ voices in 100 languages with expressive variants. Its online editor syncs audio, adds subtitles, and supports full video editing. Features voice cloning from one minute, AI script generation, royalty‑free images, and API integration.

Text-To-Speech

Freemium

OpenAgents

OpenAgents is an open-source framework for building and operating scalable, interoperable AI agent networks. It provides tools to launch, connect, and orchestrate agents with live monitoring, enabling collaborative applications and workflows.

AI Agents

Freemium

Msgmate

Open‑Chat is a self‑hostable, decentralized chat platform that supports both proprietary and open‑source AI models. It provides a full chat API, runs LLMs on local GPUs, lowers latency, enhances privacy, and deploys easily on personal or cloud servers.

Chat

Freemium

Collab.com

OneContact unifies voice, chat, WhatsApp, and social media into a single contact‑center interface, offering real‑time agent assistance, bot automation, sentiment analysis, quality monitoring, workforce optimization, and CRM integration for global scalability.

Voice

Free

Uberduck

1 0

Uberduck generates synthetic voices, text‑to‑speech, and AI music in 70+ languages. It supports voice conversion, cloning, and singing, with developer APIs and built‑in music creation for narration, branding, and marketing.

Text-To-Speech

Free

Free Voice Cloning

5 1

aiclonevoicefree.com is a free AI voice cloning tool that generates realistic podcasts by uploading short audio samples (5-30s) and converting text into cloned speech. It supports multiple formats, cross-language synthesis, and offers pitch/speed adjustments with preview and download options.

Voice

Freemium

Perso Interactive

Perso Interactive is a multimodal AI conversational platform delivering real-time, multilingual speech, vision and gesture interactions across PC, mobile and kiosks, with customizable avatars, TTS/voice cloning, precise lip-sync, automated video dubbing and SDK LLM integrations.

AI Characters

Free

OpenCraft AI

1 0

OpenCraft AI is a secure, multi‑model copilot that unifies GPT‑4, Claude, and Gemini. It preserves context across model switches, keeps uploaded files accessible, auto‑formats chats into reports or decks, and generates images with consistent voice tone for streamlined workflows.

Code assistant

Paid

Voicera

Voicera is an AI tool that automatically creates life-like voice dictations of blog articles with one click, supports over 200 languages and dialects, and benefits content creators and brands.

Voice

Freemium

lovevoice AI

5 0

LoveVoice is a text-to-speech tool that converts text into natural-sounding audio with 300+ AI voices in 70 languages. It offers customizable voice settings and outputs high-quality MP3s for videos, podcasts, and more.

Text-to-speech

Subscription

OpenHuman

OpenHuman is an open-source personal AI framework for private, on‑premises deployments and local model execution, providing an agent framework, prompt management, local speech (Whisper/Piper), integrations, Docker/one‑click deployment, and developer tooling.

Personal assistant

Free

Convai

Convai enables developers to create 3D conversational characters that perceive vision, voice, and gestures, integrate with Unity, Unreal, or WebGL, and are enriched via document uploads. It offers multilingual support, realistic animation, and scalable deployment across web, mobile, VR, and AR.

Customer support

Freemium

AudioBot

AudioBot converts written text to natural‑sounding MP3 audio using over 500 AI voices in multiple languages, including diverse Spanish accents. Users can tweak pitch, speed, and tone, making it useful for video, podcasts, and accessibility.

Text-to-speech

Paid

Speak Ai

The Speak AI tool is a language data analysis and research platform with transcription, data analysis, and sentiment analysis capabilities for various types of media.

Data analysis

Free trial

Dubbing AI

12 8 1

Dubbing AI is a free, real-time voice changer tailored for gamers and social media users. It enables transforming your voice to match game characters or anime personas, supporting 40 languages across popular platforms for immersive social experiences.

Voice

Free

Applio

Applio is an open-source AI voice cloning tool featuring over 26,000 models, multi-language support, and cross-platform compatibility. Its user-friendly interface and modular codebase cater to both novice and experienced users interested in advanced audio technology.

Audio

Free

voiceslab

4 0

VoicesLab is an AI voice cloning platform that creates realistic, expressive voice replicas for podcasts, audiobooks, and marketing. It supports eight languages, preserves accents, and lets users generate secure voiceovers instantly from text.

Voice

Freemium - $7/mo

Open Source Voice Agents

The best 50 Open Source Voice Agents AI tools - Free & Paid

Explore 50 AI for Open Source Voice Agents

Related topics

Related Topics