Local Llm Recall

The best 34 Local Llm Recall AI tools - Free & Paid

For you 👀 All categories 🎨 Free AI tools 💸 AI use cases 🤖

Explore 34 AI for Local Llm Recall

Free Only

liteLLM

LiteLLM is an open‑source gateway that unifies access to 100+ LLMs through a single OpenAI‑compatible API, enabling provider fallback, cost tracking, tag‑based budgeting, guardrails, observability, and on‑prem or cloud deployment with a lightweight SDK.

LLM

Freemium

LLMChat

4 2

LLMChat is an AI chat tool that offers a beta version experience with diverse AI models, personalized memory, custom assistant creation, and privacy-focused locally stored conversations. Explore features like plugin integration, tailored preferences, and prompt examples for various tasks.

Chat

Free

LLM Pulse

LLM Pulse tracks brand visibility and search presence across LLMs (ChatGPT, Perplexity, Google AI), offering prompt tracking and suggestions, citation analysis, visibility scoring and competitor benchmarking, sentiment and response inspection, plus API and reporting exports.

SEO

Free trial

Arena AI

4 0

LLM Arena enables users to compare multiple large language models side-by-side, analyzing features like accuracy and capabilities. It supports up to 10 models, facilitating informed decision-making for researchers and developers in selecting the right LLM for their needs.

LLM

Free

Remind ai

Remind AI is an open‑source memory system that records and indexes digital activity. It stores data from email, messaging, and document editors in a structured graph, enabling users to retrieve past actions and files with natural language via local LLMs.

Memory

Freemium

RunLLM

RunLLM is an AI platform that automates incident investigations by querying observability tools, correlating telemetry, and delivering root-cause analyses. It generates live runbooks and remediation recommendations to accelerate MTTR and create an auditable history of incidents.

Automation

Freemium

Active Recall

16 7

Recall is an AI-driven knowledge management tool that organizes and retrieves information efficiently. It summarizes content, uses an interactive knowledge graph, supports spaced repetition, and ensures cross-platform accessibility while prioritizing user data privacy.

Knowledge base management

Freemium

Related topics: 🔍 llm research assistant 🔍 opensource llm 🔍 llm builder 🔍 falcon llm 🔍 next-generation llm 🔍 llm ops

Pocket LLM

Pocketllm is an AI-powered personal document search engine that allows you to easily search and retrieve information from thousands of pages of PDFs and documents. It offers semantic search capability, fine-tuning search results and summarizing results.

Document assistant

Free trial

MICRO LLM

Micro LLM is a personal AI assistant that enhances productivity by managing tasks, scheduling appointments, and answering questions. It operates on devices like iPads and iPhones, offering offline functionality and an intuitive interface for seamless organization.

LLM

Free

Awan LLM

Awan LLM offers unlimited token generation with Meta Llama 3.1 8B and 70B models, no censorship or caps, supporting persistent AI assistance, autonomous agents, roleplay, data processing, and code completion, hosted on owned GPUs for continuous use.

LLM

Subscription

Inceptionlabs - Mercury coder

Inception Labs' diffusion-based large language models (dLLMs) offer faster, more efficient, and cost-effective text generation than traditional autoregressive models. With built-in error correction, multimodal support, and structured output control, they excel in function calling and complex data ge

LLM

Freemium

LLMStack

3 1

LLMStack is an open‑source platform that lets developers build AI agents and workflows without coding, supports multiple model providers, imports data from web, PDFs, audio, cloud services, and offers a collaborative React UI with granular permissions.

LLM

Freemium

LLMWare.ai

LLMWare AI installs a lightweight client on PCs, providing instant access to 100+ AI models optimized for Intel and Qualcomm hardware. It supports RAG, auto‑tunes weights, runs locally without Wi‑Fi, and offers an admin console for monitoring, scaling, and audit logs.

LLM

Freemium

Countless.dev

0 1

llmarena.ai offers side-by-side LLM comparisons across major providers, showing specs like context window, output capacity, modality and routing options. Filters and role-based categories help developers, ML engineers, product managers and researchers select suitable models.

LLM

Freemium

LLM Price Check

LLM Price Check aggregates LLM API models and provider details into sortable tables and a cost calculator, showing context windows, input/output cost metrics, and quality indicators to help developers and teams evaluate cost–performance tradeoffs.

LLM

Freemium - $1

LastMile AI

0 1

LastMile AI is a platform that perceives, remembers, and reasons from vision, speech, and text using LLMs as CPU and context as RAM. It connects to tools, automates workflows, anticipates needs, and surfaces actionable insights for teams and organizations.

AI Assistant

Freemium

LLMWizard

LLMWizard offers access to multiple AI models like GPT-4o and DALL-E 3, enabling users to automate tasks across coding, legal work, and content creation. The platform supports real-time comparison of AI responses for diverse insights.

LLM

Free trial

Unstract

2 0

Unstract is an open‑source, no‑code platform that automates structured data extraction from unstructured documents using LLMs. It features reusable prompts, Human‑in‑the‑Loop verification, and dual‑LLM hallucination mitigation for secure, compliant use across finance, insurance, and healthcare.

No-code

Freemium

Vllm

1 0 1

VLLM is a high-throughput, memory-efficient inference engine for Large Language Models, enabling faster responses and effective memory management. It supports multi-node configurations for scalability and offers robust documentation for seamless integration into workflows.

Infrastructure tools

Free

Falcon LLM

0 1

Falcon is an open‑source LLM family by the Technology Innovation Institute, spanning 0.09‑180 B parameters. It offers efficient Falcon‑H1 series, Arabic variants, multimodal Falcon‑3, and Falcon‑Mamba 7B, all under permissive licenses.

Development

Free

BenchLLM

BenchLLM evaluates language‑model applications via API or CLI, running JSON/YAML test suites with automated, interactive, or custom strategies. It supports OpenAI, LangChain, and any API, detecting regressions, generating reports, and visualizing results for continuous QA.

Developer tools

Freemium

Lmql

1 0

LMQL is a Python‑based language that enables modular, constraint‑driven prompts for large language models. It supports nested queries, type‑enforced outputs, and runtime distribution checks while switching between backends such as llama.cpp, OpenAI, and Hugging Face.

Code assistant

Freemium

LLMSelector

LLM Selector filters open‑source large language models by use case—chatbots, content, code, summarization, research—while presenting benchmarks, training data, architecture, and deployment details. The interface updates regularly to aid researchers, developers, and product managers in data‑driven mo

LLM

Freemium

Exllama

1 0

exllama is a memory-efficient tool for executing Hugging Face transformers with the LLaMA models using quantized weights, enabling high-performance NLP tasks on modern GPUs while minimizing memory usage and supporting various hardware configurations.

LLM

Free

LLM SEO Report

LLM SEO Report generates detailed SEO analyses for brands by assessing visibility across major AI platforms. It provides actionable recommendations to optimize online presence and adapt to evolving search trends influenced by AI technologies.

SEO

Freemium

LLM Pricing

1 0

LLM Pricing Comparison lets developers and businesses compare token costs, context lengths, and modalities for major large‑language models. An interactive calculator estimates application expenses based on input/output token volumes, helping teams budget AI workloads accurately.

LLM

Freemium

RLAMA

Rlama is a document question-answering tool that supports multiple formats and offers intelligent parsing and local processing. It enables efficient retrieval-augmented generation with features like document chunking and automatic updates, suitable for secure knowledge management.

AI Agents

Subscription

LLM Answer Engine

LLM-answer-engine is an advanced answer engine leveraging Groq, Mixtral, Langchain.JS, Brave Search, Serper API, and OpenAI to provide sources, answers, images, videos, and follow-up questions efficiently. It offers an opensource Perplexity alternative.

LLM

Free

Llongterm

llongterm is a memory-focused AI tool that enhances chatbots and virtual assistants by enabling contextual memory retention. It supports personalized interactions across various applications, including education and customer support, and is compatible with multiple programming languages.

Memory

Subscription

ComicLLM

ComicLLM allows users to easily create and customize comics, offering diverse styles and formats for both storyboards and editorial cartoons. It supports multiple languages and provides options for custom art styles, enhancing creative possibilities.

Art Generation

Freemium

web2llm

Web2llm converts web documents into structured Markdown files, extracting relevant content while omitting extraneous elements. Users can input multiple URLs, and the tool organizes individual files and provides summaries in a dedicated 'docs' folder.

Document assistant

Freemium

Reflection 70B

Reflection 70B is an open‑source 70 B Llama 3.1‑based model that uses real‑time reflection tuning for self‑correction. It outperforms GPT‑4o on MMLU, HumanEval, MATH, IFEval, GSM8K, supporting accurate coding, debugging, and reasoning tasks via API, with a no‑registration web interface.

Code assistant

Freemium - $7.9/mo

Mistral.rs

1 0

Mistral.rs is an efficient, versatile tool for high-speed large language model (LLM) inference, offering multi-device support and extensive quantization options for seamless deployment on diverse hardware setups.

LLM

Free

Little-Coder

1 0 1

little-coder is a Pi-based coding agent for running 5–25 GB local LLMs via llama.cpp or Ollama, offering Python/Node CLIs and TypeScript extensions, reproducible benchmarks, build/serve guides, and tools for local code generation, on-device development, and evaluation.

Code assistant

Free

Local Llm Recall

The best 34 Local Llm Recall AI tools - Free & Paid

Explore 34 AI for Local Llm Recall

Related topics

Related Topics