Self Hosted Llm Chat Api
The best 50 Self Hosted Llm Chat Api AI tools - Free & Paid
Explore 50 AI for Self Hosted Llm Chat Api
LLMChat is an AI chat tool that offers a beta version experience with diverse AI models, personalized memory, custom assistant creation, and privacy-focused locally stored conversations. Explore features like plugin integration, tailored preferences, and prompt examples for various tasks.
Free
LM Studio runs open‑source large language models locally on Mac (M‑series), Windows, and Linux, enabling private, offline inference. It offers command‑line and headless deployment, server‑side API, SDKs, a model hub, and LM Link for remote model access.
Free
Llama Tutor is an open‑source AI tutoring platform using Llama 3.1 and Together AI. It creates custom lesson plans and explanations for users across education levels, supports many subjects, and offers real‑time dialogue with adaptive sequencing and instant feedback.
Freemium
Llama is a local AI tool that enables users to create customizable and efficient language models without relying on cloud-based platforms, available for download on MacOS, Windows, and Linux.
Free
ColossalChat is a LLaMA‑based chatbot that offers a transparent, open‑source implementation with a basic safety filter. It allows issue reporting and operates under OpenAI Terms, making it suitable for developers and researchers needing straightforward conversational AI.
Freemium
Le Chat is an AI assistant that simplifies tasks from everyday questions to complex projects. It combines powerful AI with access to various data sources for comprehensive answers, offering features like search, code analysis, and custom workflow building.
Freemium
Chatbox - AI Copilot Desktop is a versatile cross-platform (Windows, Mac, Linux) tool featuring multiple LLN models. It combines an ergonomic design, diverse AI model assistance, and local data storage to boost productivity by converting conversations into valuable insights.
Freemium
- $3.99/mo
Chainlit is an open-source framework for building conversational AI applications that supports multimodal interactions, integrates with authentication providers, offers a prompt playground for optimization, and ensures data privacy through a self-hosted platform for managing conversational data.
Free
Uncensored AI delivers a chat platform featuring Claude Opus, Gemini, Grok, and MiniMax M2‑Her. It supports text, audio, image, and code interactions, including image‑to‑video via Image Studio. API beta and usage stats benefit developers, writers, educators, and researchers.
Freemium
MultiAI‑Chat is a Chrome extension that opens separate tabs for multiple LLMs such as ChatGPT, Gemini, Qwen, and Perplexity. It lets users configure accounts per tab, compare outputs side‑by‑side, sync history, and prioritize privacy.
Free
Chatclient lets businesses build ChatGPT‑based chatbots trained on uploaded content. It supports 95+ languages, auto‑retraining, custom personas, and white‑label deployment, and integrates with Slack, WhatsApp, Zapier, APIs, and web/mobile apps.
Freemium
LiveChatAI uses GPT‑4o to transform a knowledge base into a chatbot that auto‑resolves ~70 % of tickets. It imports content from sites, PDFs, Notion, YouTube, Q&A, supports 95 languages, automates booking, payments, CRM updates, and integrates with WhatsApp, Shopify, WordPress, Slack.
Freemium
- $39/mo
Open‑source AI code‑review platform that plugs into GitHub, GitLab, Bitbucket, and Azure DevOps at the pull‑request level. Model‑agnostic, it runs custom rule sets, tracks technical debt, and delivers real‑time metrics without storing source code.
Freemium
Voiceflow enables teams to create, test, and deploy AI‑powered conversational agents across chat, voice, phone, and web without coding. Its visual editor, real‑time collaboration, and secure deployment pipelines streamline design, evaluation, and omnichannel rollout.
Free
- $50/mo
LLM SEO Monitor tracks keyword rankings and AI-generated SERP results across ChatGPT, Claude and Gemini, highlights content gaps and ranking opportunities, provides competitor analysis, automated alerts, exportable reports and API integrations for workflow automation.
- $0.5
Code Snippets AI indexes full codebases to deliver contextual insights, auto‑generated comments, and precise snippet recommendations. It tracks LLM usage, supports multi‑model chat, offers role‑based collaboration, and integrates with macOS and Windows via API.
Freemium
- $8/mo
Helploom combines live chat, a shared inbox and an AI chatbot trained on your knowledge base to automate 24/7 multilingual support, searchable help center, analytics, escalation to human agents, and developer integration via a JS widget and REST API.
Freemium
Mtalkz is a cloud communication platform offering bulk SMS, RCS, WhatsApp API, OTP, IVR, email, and chatbot services. It supplies APIs, real‑time analytics, regulatory compliance support, and scalable messaging for businesses of all sizes.
Freemium
- $9.99/mo
LemonChat is an anonymous chat platform that connects users with strangers via text and video. It features interest-based matching, gender preference filters, and continuous moderation, ensuring a safe and engaging chat experience across multiple devices without requiring registration.
Free
Secret Llama is a private browser-based chatbot that stores data locally, ensuring enhanced privacy. It supports offline use after initial model download and functions on Chrome and Edge with GPU support, encouraging community contributions for ongoing improvements.
Free
Create, embed, and share personalized AI chat apps without coding using Dialogly. Seamlessly integrate and share GPT-enabled chat apps, fetch real-time data from external HTTP endpoints, customize app behavior with custom rules, automate tasks with Zapier, and extract textual data from URLs. Pricing
Subscription
Z.ai chat is an AI-driven conversational tool that utilizes advanced natural language processing to facilitate interactive dialogue and deep search for applications in tech blogs, coding, and research, with API support for developers and content organization features.
TypingMind unifies ChatGPT, Gemini, Claude, and other LLMs in one interface, enabling parallel chats, project folders, tagging, search, and built‑in tools for documents, images, and code, plus features like agent building, prompt chaining, RAG, voice, canvas, and plugins.
Paid
Forefront lets users chat with PDFs, Word, PowerPoint, CSVs, images, and browse the web via multiple LLMs (GPT‑4, Claude, etc.). It supports custom personas, team sharing, enterprise security, and optional self‑hosting.
Freemium
Clerk Chat unifies voice, SMS, WhatsApp, and RCS, deploying real‑time agents for lead qualification and appointment scheduling. It integrates with Salesforce, Teams, and Genesys, supports HIPAA/SOC 2 compliance, and enables 24/7 automated outreach through embedded widgets.
Paid
- $29/mo
LLM Price Check aggregates LLM API models and provider details into sortable tables and a cost calculator, showing context windows, input/output cost metrics, and quality indicators to help developers and teams evaluate cost–performance tradeoffs.
Freemium
- $1
Devv is an AI coding agent that transforms prompts into complete full‑stack AI websites. It auto‑adds authentication, LLM access, database, and image generation, streamlining build, iteration, and deployment for indie builders and small teams.
Freemium
- $49/mo
Copilot Chat generates code from user‑defined input/output test cases and optional requirements, calling an LLM to produce solutions that satisfy those tests. It refines code on failures and includes a GitHub URL parser for project context, enabling rapid, verified development.
Freemium
LMQL is a Python‑based language that enables modular, constraint‑driven prompts for large language models. It supports nested queries, type‑enforced outputs, and runtime distribution checks while switching between backends such as llama.cpp, OpenAI, and Hugging Face.
Freemium
Chat Summary is an AI tool that integrates with LiveChat, automatically generating concise summaries of chat transcripts. It extracts key points, keywords, and sentiment, providing centralized reports and actionable feedback to improve agent performance.
Free trial
- $1.67/mo
Skcript is an all‑in‑one platform that unifies full‑stack engineering, AI pipelines, and design tools, enabling teams to build, iterate, and support AI‑enabled applications across cloud environments while maintaining privacy controls.
Freemium
BenchLLM evaluates language‑model applications via API or CLI, running JSON/YAML test suites with automated, interactive, or custom strategies. It supports OpenAI, LangChain, and any API, detecting regressions, generating reports, and visualizing results for continuous QA.
Freemium
AIMLAPI.com offers a unified API endpoint for over 400 AI models spanning chat, image, video, audio, voice, text, 3D, and OCR. It supports sandbox testing, granular access control, batch requests, and an OpenClaw runtime for secure, human‑in‑the‑loop workflows.
Freemium
Chatbase builds and deploys AI customer support agents trained on business data, integrates with CRMs, Zendesk, Slack and WhatsApp, enables automated workflows, human escalation, analytics, multilingual support, and security controls (SOC 2, encryption).
Freemium
- $32/mo
Create customized software using natural language ideas with the openbmb/chatdev tool's LLN-powered multi-agent collaboration framework.
Freemium
ChatCraft is a web‑based AI coding assistant that authenticates with GitHub or Google, lets users create, edit, and save multiple chat sessions, share URLs, and invoke functions to aid code learning and creativity. It stores past conversations for easy reference.
Freemium
Ava is an open‑source desktop app that runs language models locally using llama.cpp, offering a GUI or headless mode. Built with Zig/C++ and SQLite, it enables rapid prototyping, privacy‑focused experimentation, and straightforward local deployment.
Freemium
Millis AI enables ultra‑low‑latency voice agents (~600 ms response) with no‑code or low‑code tools, supporting inbound/outbound calls in 100+ countries, webhook integration, multiple LLMs, custom voice cloning, and deployment across phone, web, mobile, SDKs, widgets.
Free
- $9.99/mo
ChatBetter is a unified AI platform that automatically selects and chains the best language models for any query or complex task. It enables side-by-side response comparison and supports team collaboration with enterprise-grade security and project management.
Free trial
- $20/mo