Github Voice Recognition

The best 50 Github Voice Recognition AI tools - Free & Paid

Free AI tools 💸 All categories 🎨 Deals ％ For you 👀

Explore 50 AI for Github Voice Recognition

Free Only

Deepgram Voice AI

Deepgram Voice AI offers real‑time and batch speech‑to‑text, text‑to‑speech, and voice‑agent APIs. It delivers low‑latency transcripts, natural‑sounding synthesis, and integrated conversation handling for contact centers, transcription, and podcasts, with cloud, on‑prem, and telephony support.

Text-to-speech

Freemium

Voice.ai

16 3

Voice.ai offers cloud‑and on‑prem AI voice agents for calls, scheduling, and queries, supporting 15+ languages. It provides text‑to‑speech, 10‑second voice cloning, real‑time voice change, noise filtering, and integrates with Salesforce, HubSpot, Zendesk, Slack. APIs and SDKs enable scalable deploym

Voice

Freemium - $5/mo

VoiceBox

3 0

Voicebox is an open-source desktop app for voice cloning and TTS that clones voices from short samples, supports WAV/MP3/FLAC/WEBM and mic capture, multi-voice timeline editing with effects, local or remote GPU inference, Whisper STT, and API integration.

Voice

Free

SpeechGen

22 7

SpeechGen.io converts up to 2 million characters into high‑quality neural‑voice audio across 150 languages with 5,000 models. It allows voice, speed, pitch, volume control, SSML tags, background music, multi‑speaker tagging, downloadable formats, and a REST API.

Text-to-speech

Paid - $4.99

Voicechanger.io

19 5

Voice Changer .io allows uploading or live recording, applying effects such as monster, robot, alien, echo, reverse, slow, fast, and custom pitch, previewing them in real time, and downloading the result as .wav for podcasts, videos, streams, or presentations.

Voice

Subscription

Google Gemini

30 4 3

Gemini is an AI assistant and chatbot provided by google based on Gemini LLM family. It provides access to Google's advanced AI systems with many features and integrations to help you with daily workflows and tasks."

Leading AI Assistants

Freemium - $20

GoSpeech

GoSpeech is an app that uses AI-generated faces for multilingual conversations, enabling users to create personalized videos and foster global communication via avatars while supporting charitable causes.

Language Learning

Freemium

Related topics: 🔍 ai speech recognition 🔍 voice assistant api 🔍 open source voip 🔍 audio processing tools 🔍 text to speech engine 🔍 speech synthesis library

VoiceChanger.im

3 2

VoiceChanger.im converts voice recordings or text into high‑quality audio with AI‑generated effects such as gender conversion and robotic tones. Server‑side processing supports multiple formats, precise parameter control, and downloadable files for podcasts, videos, or social media.

Video editing

Free

Voice Design AI

Free text‑to‑speech platform supporting advanced AI models. Offers real‑time, natural‑sounding voice with emotion, multi‑language, and voice‑cloning. Users adjust pitch, speed, and parameters. API integration for podcasts, audiobooks, assistants, e‑learning, accessibility.

Text-to-speech

Free

Github Copilot Voice

GitHub Next Project: Write code without the keyboard using voice commands with GitHub Copilot.

Code assistant

Free trial

Voicemaker

13 1 1

Voicemaker is a cloud‑based text‑to‑speech platform offering 1,500+ AI voices in 130+ languages. It lets users adjust pitch, speed, pauses, add effects, clone voices with a minute of audio, and export to MP3, WAV, OGG, AAC, or OPUS.

Text-to-Speech

Freemium

FineVoice

19 6

FineVoice is an AI voice studio offering personalized custom voices and professional-grade video voiceovers with diverse AI voices and powerful tools for efficient video creation

Audio

Subscription

AI Voice Detector

2 1

AI Voice Detector identifies AI‑generated speech with up to 99 % accuracy. It analyzes MP3, WAV, OGG, M4A, MP4, MOV files up to 10 min by segmenting audio, applying voice‑activity detection, and deep‑learning scoring. Supports multiple languages, Chrome extension, desktop app, API.

AI detection

Subscription - $24.99

Uberduck

1 0

Uberduck generates synthetic voices, text‑to‑speech, and AI music in 70+ languages. It supports voice conversion, cloning, and singing, with developer APIs and built‑in music creation for narration, branding, and marketing.

Text-To-Speech

Free

Voicemod

16 5

Voicemod provides real‑time voice modulation on Windows and macOS with a virtual microphone, 200+ AI‑generated voices, soundboard, instant 30‑second replay, low‑latency keybinds, Voicelab editing, on‑device AI, and hardware integration for streaming.

Audio & Voice

Freemium

Gotalk

The Ultimate AI Voice Generator by gotalk.ai uses advanced deep learning technology to quickly convert text into natural speech. Craft synthetic voices with human-like nuances effortlessly for tasks like videos, podcasts, and phone greetings.

Audio generation

Free trial

VoiSpark

2 2

VoiSpark is an AI voice generator for text-to-speech and voice cloning, offering 500+ natural voices in 30+ languages. It enables custom emotions, styles, and unique vocal identities, with seamless integration for voiceovers in videos, podcasts, and apps.

Voice

Freemium - $9.9/mo

lovevoice AI

5 0

LoveVoice is a text-to-speech tool that converts text into natural-sounding audio with 300+ AI voices in 70 languages. It offers customizable voice settings and outputs high-quality MP3s for videos, podcasts, and more.

Text-to-speech

Subscription

ElevenLabs

18 3 1

ElevenCreative is an AI tool that generates ultra-realistic speech, videos, music, and sound effects, offering text-to-speech, voice cloning, and a library of pre-recorded voices for creating personalized content for various applications.

Audio generation

Freemium - $5/mo

Fish Speech

18 6

Fish Audio S2 delivers real‑time text‑to‑speech with fine‑grained emotional tags and voice cloning from 15 seconds of audio. Its low‑latency API, SDKs, and multilingual support enable developers to create studio‑quality narration, dialogues, and voice agents.

Text-to-speech

Freemium

VoiceGPT

1 0

VoiceGPT lets Android users chat with ChatGPT via voice, offering hotword activation, multilingual input/output, and unlimited free messaging. It supports OCR for image text extraction, code execution in 70+ languages, and DALL‑E 2 image creation, all within a dark/light theme.

Personal assistant

Free

BlabbyAI

4 2

BlabbyAI is a speech-to-text tool that integrates with over 50,000 websites. It converts your speech into accurately formatted text with automatic punctuation and support for 90+ languages.

Speech-to-text

Freemium

SoundHound AI

SoundHound AI is a conversational voice AI platform that provides voice assistants, developer tools, and enterprise AI agents capable of listening, reasoning, and acting. It enables custom voice experiences across industries like automotive, restaurants, and contact centers, with features including

Voice

Freemium

kikivoice.ai

2 3

KikiVoice is an AI voice cloning tool designed for creators, enabling rapid generation of realistic voice clones from short audio samples. It offers versatile models for various applications, including voiceovers and multilingual content creation.

Voice

Speech-to-Speech

17 3

Resemble AI delivers real‑time voice conversion and cloning from brief samples, supports 149+ languages, lets users edit audio via text, and includes deep‑fake detection, watermarking, and API integration for secure, ethical use.

Voice

Freemium - $0.006

Voiceflow

15 5

Voiceflow enables teams to create, test, and deploy AI‑powered conversational agents across chat, voice, phone, and web without coding. Its visual editor, real‑time collaboration, and secure deployment pipelines streamline design, evaluation, and omnichannel rollout.

Chat

Free - $50/mo

Voxify

4 2

Voxify is an advanced AI voice generator tool that offers customizable voice-overs in multiple languages, accents, emotions, tones, styles, pacing with fast turnaround times, affordable pricing options, and flexible subscription plans.

Text-to-Speech

Freemium - $4.99/mo

Vbee AI Voice

12 10 1

Vbee Aivoice is an AI text-to-speech platform that converts text into natural-sounding audio across multiple languages. It offers various voices, supports voice cloning, and provides MP3/WAV output, ideal for podcasts, e-learning, and audiobooks.

Text-to-speech

Freemium

GPT Reader & Transcriber

gpt-reader.com is a community-driven AI development platform that enables open-source teams to manage projects through proposal voting, AI-assisted code reviews, and integrated CI/CD workflows.

Developer tools

Freemium

Play.ht

19 9

PlayAI turns text into natural‑sounding audio in 42+ languages using 800+ voices. Users adjust pitch, rate, volume, add SSML pronunciations, support multi‑speaker real‑time synthesis, voice cloning, and API integration for chatbots, streaming, IVR, e‑learning.

Text-To-Speech

Free trial - $29/mo

Genspark.ai

23 9 1

Genspark unifies inbox, workflows, and collaboration into one AI workspace, offering a 1‑million‑token context window, voice‑to‑text, auto‑meeting notes, and Chrome extensions for instant summarization and task automation across WhatsApp, Slack, and Teams.

AI Assistant

Freemium

Speechify

21 5

Speechify converts PDFs, DOCX, EPUB, web pages, and more into natural‑sounding audio on iOS, Android, macOS, Windows, and Chrome. It offers an AI assistant that summarizes documents while you listen, supports voice typing, and allows offline access.

Text-To-Speech

Free trial - $29/mo

All Voice Lab

3 1

Voice Lab AI is a text-to-speech and voice cloning tool that generates realistic, expressive voices for audiobooks, voiceovers, and narration. It offers multilingual support, tonal nuance, and robust data security features like encryption and access controls.

Text-to-speech

Freemium - $3/mo

Dubbing AI

12 8 1

Dubbing AI is a free, real-time voice changer tailored for gamers and social media users. It enables transforming your voice to match game characters or anime personas, supporting 40 languages across popular platforms for immersive social experiences.

Voice

Free

Voicemy

Voicemy.ai enables users to create, share, and inspire voice songs using AI. Users can clone voices, train voice models, and convert text to speech, fostering creativity and expression.

Audio generation

The AI Voice Generator

4 2

The AI Voice Generator is a versatile tool that creates lifelike voiceovers in 120+ languages and 800+ voices from text inputs. It supports accents, genders, and celebrity mimicry, ideal for content creators and casual users.

Text-to-speech

Free

Dograh

4 0

Dograh is an open-source VAPI alternative for building self-deployed AI voice agents, offering a no-code drag-and-drop builder, telephony and multilingual (30+) support, voice customization, advanced NLP with intent handling, intelligent human routing, and real-time analytics.

AI Agents

Freemium

WellSaid.io

WellSaid converts scripts into natural speech with 120+ licensed voices, tone/speed/pronunciation controls, and Studio plus API for real-time generation, editing, collaboration and integrations—supporting scalable, consistent voiceovers for e-learning, IVR, apps, and video.

Text-to-speech

Free

VoiceDub

1 0

Voicedub 2.0 is an AI tool featuring a vast collection of AI voices for producing exceptional voice covers. It combines voice cloning and text-to-speech technologies, enabling users to create professional vocals and replace existing song vocals seamlessly. Its intuitive interface and active Discord

Audio generation

Freemium - $2.99

Voicera

Voicera is an AI tool that automatically creates life-like voice dictations of blog articles with one click, supports over 200 languages and dialects, and benefits content creators and brands.

Voice

Freemium

vocalimage.app

Vocal Image is an AI-based coaching app that improves speaking skills through personalized voice assessments and targeted programs for speech recovery, accent reduction, and voice transformation, fostering a supportive community and offering educational content for users.

Coaching

Free

VoiceChanger.video

3 2

Voice Changer is a free AI tool that instantly transforms your voice with over 100 textures and 20 languages. It is suitable for video dubbing, content creation, and educational purposes, featuring a user-friendly interface for quick uploads and real-time previews.

Text-to-speech

Free

GoVoice

Govoice is an innovative AI tool that translates spoken words into text effortlessly. Suitable for small businesses and individual entrepreneurs, it boosts productivity by facilitating diverse content creation through voice input.

Text-to-speech

Free trial - $15.9/mo

Voiser

Voiser offers multilingual text‑to‑speech and speech‑to‑text in 75+ languages, supporting diverse audio/video formats. It provides speaker detection, subtitle editing, voice cloning, avatar lip‑sync, web embed, and API integration for creators and developers.

Text-to-speech

Freemium

Gliglish

1 0

Gliglish is an AI‑powered language learning platform offering voice‑based conversation practice with real‑time pronunciation feedback and contextual translations. Users can adjust speed, choose topics, and access mini‑classes across many languages, supporting mobile and desktop use for individual or

Language Learning

Paid

Jogg AI

11 2

JoggAI generates lifelike avatar videos from text or audio, offering script‑to‑video automation, voice cloning, and batch production. Users can create talking photo, podcast, or URL‑to‑video clips without filming or complex editing.

Advertising

Freemium - $29/mo

PrankGPT

PrankGPT lets users generate voice‑based prank calls by entering a phone number, selecting a voice (Marv, Zephyr, or Google Cloud), and providing a prompt. The client‑side app uses Vocode and produces high‑quality audio for hobbyists and developers.

Fun

Freemium

FakeYou

14 4

FakeYou converts text into spoken audio, supports voice-to-voice synthesis, and offers a Voice Designer for custom AI voices. It enables zero‑shot cloning from a single sample, voice conversion, and integrates with media projects for streamlined content creation.

Text-to-Speech

Subscription - $12/mo

BoldVoice

10 3

Boldvoice is an AI application that enhances American English pronunciation by offering instant feedback and guided lessons. It targets challenging sounds and promotes consistent practice, supporting users worldwide to achieve clear and confident speech.

Language Learning

Free trial

Free Text-To-Speech

2 0

A web‑based Microsoft AI TTS tool offering 330+ neural voices in 129 languages. Users can adjust rate, pitch, pauses, and style for news, scripts, or narration. Works across Chrome, Firefox, Edge, with an API for web integration.

Customer support

Free

Github Voice Recognition

The best 50 Github Voice Recognition AI tools - Free & Paid

Explore 50 AI for Github Voice Recognition

Related topics

Related Topics