Open Source Data Management
The best 50 Open Source Data Management AI tools - Free & Paid
Explore 50 AI for Open Source Data Management
Open Knowledge Maps is an AI search engine that visualizes scientific literature across disciplines, clustering related papers to reveal topic connections and trends. It supports varied document types, offers high‑quality metadata, multilingual browsing, and open‑source integration.
Freemium
Secoda centralizes data cataloging, metadata management, and lineage tracking, offering AI‑driven search, query monitoring, and quality scoring. It provides role‑based access, CI/CD impact analysis, and real‑time observability dashboards to streamline workflows.
Free
Openkoda is an open‑source insurtech platform providing modular templates for claims, policy, and embedded insurance. It offers AI analytics, automated documents, role‑based access, multi‑tenant clustering, and API hooks for rapid, scalable development without vendor lock‑in.
Freemium
OpenDoc AI is an advanced productivity tool that simplifies data science tasks with customizable automation, ready-made workflows, and plain English queries for instant data insights. Streamline tasks, integrate AI tools effortlessly, and boost data analytics efficiency.
Free trial
Open Apps is an open-source app directory that offers a curated selection of free alternatives to popular software tools, enabling users to find quality open-source solutions across various categories for development and productivity needs.
Free
Dropbox Dash is an AI-driven search tool unifying data from connected apps & emails. It boosts productivity through smart collections, centralized views, and efficient answers for improved workflow management.
Freemium
Sourcetable is an AI‑powered spreadsheet platform that lets users query data in plain English, auto‑generate charts, Python/SQL code, and clean data. Built‑in connectors link to databases and apps, while templates enable quick reporting.
Freemium
- $20/mo
Dawiso is a knowledge management and data governance platform that centralizes data management with AI-powered analytics, data lineage visualization, and a business glossary. It ensures accurate financial reporting, regulatory compliance, and seamless integration with diverse data sources for effici
Freemium
Data On Demand consolidates structured, unstructured, and streaming data into a single source of truth, providing machine‑learning‑driven forecasting, anomaly detection, and decision optimization. It offers real‑time dashboards, AI alerts, and predictive models in a secure, collaborative workspace.
Free trial
Curiosity unifies enterprise data into a knowledge graph, enabling AI‑powered search and assistants across legacy and modern systems. It deploys on‑premises for GDPR compliance, offers fast hybrid search, and reduces response times and error rates.
Subscription
Mevo is an open‑source platform that lets developers and data scientists host and customize their own instances on any OS or cloud. With GitHub‑hosted code, full documentation, and modular architecture, it supports integrations and ensures data privacy and compliance.
Free
Basedash lets teams ask plain‑English questions of their data warehouses and SaaS sources, automatically generating validated SQL, executing it, and visualizing results in dashboards. It supports 750+ integrations, enforces SOC 2 compliance, and offers an embedding API for internal products.
Paid
LightOn Enterprise Search is a secure on‑prem RAG platform that indexes text, images, PDFs, and scanned documents. It offers multimodal retrieval, a production‑ready API, white‑label interface, and compliance‑aware analytics for regulated industries.
Paid
DataCamp provides interactive courses, hands-on projects, and role-based career and skill tracks for data science, ML, and AI. It covers Python, R, SQL, cloud platforms, LLMs, and MLOps, plus team analytics and customizable learning paths.
Freemium
Memento Database is a cross-platform, customizable database application that allows users to manage and organize data without coding. It supports automation, collaboration, and various visualization options, making it suitable for personal and business needs.
Freemium
- $4/mo
Airbyte is an open-source data integration platform for building ELT/ETL pipelines with 600+ connectors, real-time replication and reverse ETL, low-code/custom connector development, and deployment options for cloud, private, and enterprise compliance controls.
Free trial
- $10/mo
OpenCode.ai is an open-source AI coding agent that runs directly in your terminal, IDE, or desktop. It connects to 75+ LLM providers, supports offline use, and enables multi-session collaboration for code review and debugging.
Free
Open SaaS is an open-source framework for building scalable applications with React and Node.js, offering features like pre-configured authentication, payment integrations, TypeScript support, an admin dashboard, and easy deployment without vendor lock-in.
Free
Papermerge DMS is open‑source document management storing, indexing, and searching PDFs, JPEGs, TIFFs. OCR via Tesseract adds selectable text; versioning, tagging, custom metadata, page editing, and a web interface support archivists, legal teams, and small businesses.
Freemium
Athena AI consolidates SQL, NoSQL, and file data for real‑time synchronization with hashing/blockchain integrity and live query preview. It offers 21 visual charts, granular permissions, dashboards, alerts, scheduled reports, and API integration.
Freemium
- $1
AI and data analytics platform delivering end‑to‑end solutions across multiple sectors. It accelerates experimentation to production, supports data engineering, MLOps, LLMOps, and digital engineering, integrating Databricks, Snowflake, and Google Cloud to shorten insight‑to‑action time and boost eff
Subscription
Unopim is an open-source Product Information Management software that centralizes and enriches product data, integrates with e-commerce platforms, enhances digital asset management, and supports customization, making it suitable for businesses to optimize their product information processes.
Free
GoSearch consolidates indexed and non‑indexed data from 100+ apps, letting teams query across email, chat, documents, and private files with AI assistants. It automates routine tasks through custom agents, enforces granular security, and supports multiple LLMs for unified enterprise knowledge.
Freemium
- $20/mo
Tinybird is a data platform for high-throughput streaming ingestion and management of large datasets. It features zero downtime schema migrations, instant SQL APIs, and seamless integration with tools like Kafka and S3, ensuring reliable data operations.
Subscription
Open Notebook is a self-hosted, open-source notebook for private LLM workflows, supporting over 16 AI providers. It enables multi-modal content management, vector search, and contextual chat with full data sovereignty for research and development teams.
Freemium
Julius AI connects spreadsheets, databases, and cloud storage, letting users ask natural‑language questions. It delivers instant charts, tables, and reports, sharable in Slack or on a schedule, and supports no‑code plus R, Python, or SQL workflows, keeps data private.
Free
Zilliz Cloud is a fully managed Milvus-based vector database offering billion-scale similarity search, multi-cloud serverless and distributed clusters, SDKs and APIs, embedding pipelines, AUTOINDEX/Cardinal acceleration, RBAC and observability for RAG, semantic search, and recommender systems.
Freemium
- $7/mo
Grokipedia is an AI-driven knowledge base featuring over 885,279 articles and a user-friendly search function. It offers multiple themes and an intuitive interface to facilitate efficient research across a wide range of topics.
Free
OpenHouse.ai consolidates sales, marketing, and operations data into a real‑time analytics engine that detects shifts in traffic, buyer behavior, pricing pressure, and sales velocity at the community level, diagnosing drivers and prescribing targeted pricing, incentive, and operational actions.
Subscription
Indico Intake and Orchestration Platform automates ingestion, enrichment, and routing of unstructured insurance data—extracting emails, PDFs, SOVs, loss runs, and ACORD forms into structured, validated outputs for underwriting, claims, and policy servicing, with real‑time processing and AI‑driven en
Freemium
Iris.ai unifies enterprise data into secure AI agents, enabling retrieval‑augmented generation workflows. It ingests millions of documents, supplies evaluated answers, and offers real‑time dashboards for governance, cost‑efficient LLM deployment across regulated industries.
Freemium
Hex unifies notebooks, conversational queries, and dashboards in a single workspace. It uses shared semantic context to offer reliable insights from Snowflake, BigQuery, Redshift, and more. Data scientists write code, while business users ask plain‑language questions via Threads or Slack.
Freemium
- $36/mo
Ragie is a rag-as-a-service platform that simplifies data ingestion and indexing for developers. With APIs for popular sources, it supports structured and unstructured data, ensuring timely updates and efficient processing for context-rich AI applications.
Free trial
FreedomGPT unifies access to 400+ AI models, showing side‑by‑side answers for voting and auto‑selection via leaderboard. It keeps privacy safe, runs on Windows/macOS, and is open‑source for community contribution and collaboration.
Free
Gamma.AI is a cloud DLP tool integrated with Palo Alto Networks CASB that automatically discovers and classifies data across 150+ SaaS apps with 99.5% accuracy. It offers one‑click deployment, real‑time remediation, and API connectors for SIEM/SOAR integration.
Freemium
Label Studio is an open‑source platform for labeling images, audio, text, video, time‑series, and PDFs. It offers customizable interfaces, pre‑labeling with ML, multi‑project support, API/SDK integration, and quality gates that ensure consistent annotations, with export to CSV or databases.
Freemium
- $10
Ask Data is an open-source, chat-based tool that enables users to create and manage data pipelines using natural language commands. It simplifies data integration, cleansing, and transformation, making data engineering accessible to both technical and non-technical users.
Free trial
Wrapsody virtualizes documents with persistent IDs, providing end‑to‑end visibility, automated encryption, fine‑grained permissions, and audit trails. It tracks versions and chat logs, uses ML to prune duplicates, offers AI summarization/search/Q&A, and safeguards against ransomware with backup and
Freemium
Peaka consolidates diverse data sources into a single governed layer, enabling real‑time, zero‑ETL querying with natural language and SQL. It offers API‑to‑SQL conversion, cross‑database joins, ready‑made connectors, and SOC 2 compliant governance.
Freemium
- $1/mo
Encord is a data development platform that streamlines data curation, labeling, and model evaluation for AI teams. It supports computer vision and multimodal tasks with advanced user management, customizable workflows, and comprehensive quality metrics.
Subscription
APIPark is an open-source AI gateway and API portal that simplifies AI model management, integration, and deployment, offering unified API formatting, lifecycle management, and secure multi-tenant support for efficient AI usage.
Free
Lens by GitBook is an AI-enhanced internal knowledge base that facilitates Git-like collaboration, deep integrations, and content audits. It promotes effortless contribution, organized management, and real-time teamwork for current documentation.
Freemium
OpenSQL.ai simplifies SQL query generation by converting natural language questions into SQL code, making database interactions accessible for all skill levels. It streamlines data analysis and offers an API for integration, prioritizing user data security.
Free trial
OpenLIT is an open‑source observability platform for large‑language‑model applications, offering distributed tracing, real‑time monitoring, model evaluation, prompt versioning, fleet telemetry, and a zero‑code Kubernetes operator to integrate with major LLM providers and vector databases.
Subscription
- $10/mo
AI agents scan 300,000+ sources—including dark‑web forums and new domains—to deliver real‑time OSINT alerts with context on threat actors, intent, and campaigns. Customizable workflows target phishing, insider risk, or credential leaks, enabling rapid response and fraud reduction.
Freemium