Local Text To Video
The best 50 Local Text To Video AI tools - Free & Paid
Explore 50 AI for Local Text To Video
Translate.video automates video localization: it transcribes, generates subtitles, and dubs content in 75+ languages using voice cloning from a 50‑second clip. Users can edit captions, export SRT/VTT/MP4, and integrate plugins for Photoshop, Illustrator, and Figma.
Freemium
- $29/mo
HeyGen automatically produces 1080p/4K videos from text, images, or audio, adding voiceovers, subtitles, and brand‑aligned styles. It supports avatar animation, photo‑to‑video, and multilingual translation with lip‑sync, enabling quick, localized visual content for marketing, training, and social me
Freemium
- $24/mo
TranscribeToText.AI turns audio and video files—up to 10 hours or 5 GB—into accurate text in 100+ languages, supporting MP3, MP4, WAV, OGG, etc. Export as DOCX, PDF, TXT, SRT, VTT or import from URLs, YouTube, Google Drive, Dropbox, or live meetings.
Freemium
Invideo AI transforms text into high-quality, cinematic videos with AI-generated visuals, voiceovers, and subtitles. It offers flexible workflow templates, editing options, and features like AI avatars and voice-cloning for personalized content creation.
Subscription
- $25/mo
AIVideo.com automates video production, creating music videos, lyric visuals, looping clips, and converting audio or images into video. It offers text‑to‑image/video, background removal, matchcut editing, and visual effects, enabling quick, professional media creation.
Freemium
Google Veo 3 generates 8‑second, full‑HD cinematic clips from text prompts with lip‑synced dialogue and ambient audio. It animates still images, adds motion, lighting, perspective shifts, and over 60 visual effects for quick online video prototyping.
Subscription
- $7.9/mo
AI Video Generator by Clipfly seamlessly transforms text into engaging video frames. Easily add subtitles, stickers, music, and merge clips. Enjoy features like face swap and voiceover for professional video creation effortlessly.
Freemium
ShortVideoGen is an efficient text-to-video tool that quickly generates customized videos with audio based on text inputs. Users can easily create engaging videos by specifying frames per second and sound preferences.
Freemium
Textideo is an AI-powered tool that transforms text prompts and images into 1080p videos. It enables control over style and composition to create cohesive multi-shot sequences with special effects.
Subscription
- $8.33/mo
LTX Studio is an AI‑powered web platform that converts text prompts into videos, images, or script‑to‑video outputs, offers camera keyframing, storyboard creation, AI‑generated assets, and collaborative editing—all within a single desktop‑browser workspace.
Subscription
Vozo AI Video Translator converts video content into 110+ languages with context‑aware translation and automatic transcription. It clones original speaker voices, syncs lip movements, replaces on‑screen text, and offers bilingual subtitles, real‑time editing, and secure enterprise integration.
Subscription
- $25/mo
Viralvideo is an AI platform that transforms text into engaging videos for social media. It features automated scene generation, realistic voiceovers, and scheduling options, streamlining video creation for marketers and creators.
Free trial
Vidful.ai turns text and images into short videos in about a minute, using Kling AI for motion and Luma AI Dream Machine for cinematic camera work. It offers text‑to‑video and image‑to‑video modes, delivering quick, professional clips directly in the browser.
Subscription
- $7.9/mo
Video To Blog converts YouTube links or uploads into ready‑to‑publish blog posts in under a minute, supporting 30+ languages. It formats prose, adds headings, SEO metadata, and embeds, and outputs HTML, Markdown, PDF, or links.
Paid
LuvVoice is a free online text-to-speech tool that converts text into audio using over 200 voices in 70 languages. Users can customize speech rate and pitch, making it suitable for content creation and educational purposes.
Freemium
Videofa.st automates subtitling for short videos, providing accurate captions in 99 languages. It enhances accessibility, engagement, and maintains brand aesthetics with customizable, professional-quality outputs, compatible with various video formats and easy to integrate into workflows.
Freemium
- $6/mo
TurboScribe is an AI-powered transcription tool offering ultra-fast conversion of audio and video files to text. It supports over 98 languages, handles uploads up to 10 hours long, and features speaker recognition for meetings, interviews, and podcasts.
Freemium
- $10/mo
Make‑A‑Video converts text prompts into short videos, using trained models on image‑text pairs and large video datasets. It can generate single‑shot videos or animate stills by interpolating motion, and offers variation mode for multiple outputs, all watermark‑marked and filtered.
Freemium
Videoticle turns YouTube videos into Medium‑style text articles by summarizing key points. Paste a URL, pick a language, and read concise summaries on desktop or via a mobile plugin, saving time for creators, researchers, and students.
Freemium
CaptionCreator automatically transcribes and captions audio/video in over 50 languages, detecting input language and translating to English. It handles noisy and multilingual speech, supporting files up to 2 GB and offering unlimited processing for registered users.
Paid
- $30
Video Transcriber AI is a tool that instantly converts videos from MP4, YouTube, or Zoom into text. It offers speaker recognition and accuracy modes for transcriptions up to 1GB, with no sign-up required.
Freemium
Uniscribe is a speech text converter that transcribes audio and video files in 98 languages, offering output formats like TXT, PDF, DOCX, and SRT. It also generates summaries, mind maps, and extracts key insights from the transcriptions.
Free trial
- $6/mo
Storykit automatically transforms written content into high‑quality videos across multiple formats and languages. The AI‑powered template and text‑to‑video engines eliminate manual editing, cutting production time by up to 95 % and enabling teams to scale video output without expanding staff.
Subscription
AIVideoGenerator.me is an AI Video Generator based on Luma technologies that swiftly creates realistic videos from text description prompts.
Freemium
AI Video Generator allows users to quickly transform images and text into high-quality videos, featuring text-to-video and image-to-video capabilities, AI avatars, and intuitive templates, making it suitable for both personal and commercial video production.
Freemium
- $6.5
LTX Desktop is an open-source AI video production suite using the LTX-2.3 multimodal engine for local text-to-video, image-to-video and audio-to-video generation, combined with a non-linear editor, timeline tools, subtitle/XML interoperability and on-prem model management.
Free
JoggAI generates lifelike avatar videos from text or audio, offering script‑to‑video automation, voice cloning, and batch production. Users can create talking photo, podcast, or URL‑to‑video clips without filming or complex editing.
Freemium
- $29/mo
Vidfly.ai is an AI video generator that creates professional videos from scripts, text, or images using over 50 AI models. It automatically adds realistic voiceovers and subtitles, supports multiple export formats, and requires no editing experience.
Freemium
VideoToPage transcribes audio/video, structures content, and auto‑generates blog posts, SEO articles, social snippets, tutorials, SOPs, and course modules. It extracts themes, shots, OCR text, supports batch uploads, multilingual, and publishes directly to WordPress, Notion, Ghost, Shopify, and soci
Paid
Luna AI Video Generator turns text prompts or images into short, realistic videos using transformer models trained on video data. It supports multiple languages, offers real‑time web generation, and scales with GPU resources for designers and educators.
Paid
CinemaFlow AI converts scripts into full videos with one-click automated scene selection and AI cinematography. It offers customizable templates and cinematic styles, advanced editing with real-time previews, adjustable SD–4K rendering, and team collaboration controls.
Subscription
FreeTTS delivers browser‑based AI audio utilities: multilingual text‑to‑speech, accurate speech‑to‑text transcription, vocal isolation, voice enhancement, precise cut/join, and format conversion (MP3, WAV, FLAC, OGG, M4A). All processing is local and files auto‑delete after 12 hours.
Freemium
Flickify converts URLs, scripts, or prompts into narrated videos. It extracts text, chooses voice and style, auto‑matches stock or AI images, and offers a slide editor for publishing. Bulk creation and auto‑pilot support large‑scale production.
Freemium
- $18/mo
Transkriptor converts audio/video files into editable, timestamped transcripts in 100+ languages, auto‑detecting speakers. It extracts summaries, action items, and sentiment, and integrates via Zapier with CRMs and PM tools for automated workflow routing.
Subscription
- $30/mo
OLOCR extracts text from images and PDFs in over 100 languages, including CJK. It runs fully in the browser, keeping documents local, and outputs plain text, Word, or searchable PDFs, with optional AI correction and batch processing.
Freemium
- $3.99/mo
Virbo is an AI video generator that turns text or images into videos using 350+ avatars with multiple voices. It supports 80+ languages, offers script creation, translation, voice‑cloning, cross‑device workflow, and an API for automated production.
Paid
- $19/mo
DreamLux is a versatile AI video generator that transforms text prompts and static images into watermark-free videos using advanced algorithms. It offers customizable templates and styles, enabling users to create unique, professional-quality content effortlessly.
Free trial
- $4.99
Animaker Subtitle Generator auto‑transcribes audio, adds and edits subtitles with a click, supports 20+ animated styles, translates to 100+ languages, allows manual adjustments or .srt/.vtt uploads, and exports videos or subtitle files for broader use.
Free
- $10/mo
VideoTube is an AI video generator that transforms text, images, and video into dynamic, engaging social content with customizable templates, voiceovers, and effects. It enables rapid rendering, seamless editing, and easy sharing across social media platforms for diverse video projects.
Freemium
MagicLight is an AI art generator that creates long, consistent videos from text with multiple visual styles. It supports multilingual voiceovers in 10+ languages and 30+ emotional tones, available on desktop and mobile.
Free trial
Lanta AI is an online platform for creating AI-powered videos from images and text, featuring lifelike avatars, style conversion, and prompt-based editing. It offers fast rendering, high-quality outputs, and tools like batch processing and multi-scene transitions.
Freemium
- $6/mo
Maestra transcribes and translates audio/video into searchable text, subtitles, and dubbed audio across 125+ languages, offering live transcription, subtitle editing, voice cloning/TTS, collaboration tools, content workflows, and APIs for integrations and automated publishing.
Freemium
Rask is an AI-powered localization tool that offers video translation, captioning, subtitling, voice over, and dubbing services in multiple languages, with a 14-day free trial for businesses, content creators, and educators.
Free trial
- $60/mo
Verbalate automates video translation into 230+ languages, providing subtitles, voice cloning, and lip‑sync options. Users edit transcripts, perform back‑translation, and integrate via API, supporting industry terms and optional human verification for accuracy.
Subscription
- $9/mo
Ovi Video Generator creates prompt-driven text-to-video and image-to-video clips with physics-accurate motion, synchronized lip and ambient audio, realistic visual effects, and editable MP4 outputs—fast (30–60s) production, supporting short iterative clips up to 10 seconds.
Free trial
- $9/mo
Image to Text Converter uses AI OCR to extract editable text from JPG, PNG, GIF, WEBP, BMP, HEIC, TIFF, and PDF images. It supports over twenty languages, allows drag‑and‑drop and batch processing, and automatically deletes uploads for privacy.
Paid
- $2.99