Data Lakehouse
The best 50 Data Lakehouse AI tools - Free & Paid
Explore 50 AI for Data Lakehouse
DataCamp provides interactive courses, hands-on projects, and role-based career and skill tracks for data science, ML, and AI. It covers Python, R, SQL, cloud platforms, LLMs, and MLOps, plus team analytics and customizable learning paths.
Freemium
LakeSail is a Rust‑native Spark Connect engine that runs Python workloads at native speed, eliminates JVM overhead, and queries multimodal lakehouse data (PDFs, images, videos, tables) inside your AWS account with zero‑ops elastic compute and built‑in governance.
Freemium
AI and data analytics platform delivering end‑to‑end solutions across multiple sectors. It accelerates experimentation to production, supports data engineering, MLOps, LLMOps, and digital engineering, integrating Databricks, Snowflake, and Google Cloud to shorten insight‑to‑action time and boost eff
Subscription
Basedash lets teams ask plain‑English questions of their data warehouses and SaaS sources, automatically generating validated SQL, executing it, and visualizing results in dashboards. It supports 750+ integrations, enforces SOC 2 compliance, and offers an embedding API for internal products.
Paid
DataHawk aggregates daily SKU‑level data, ad metrics, and profitability signals across Amazon, Walmart, and other e‑commerce channels, delivering real‑time dashboards, AI alerts for KPI shifts, ROAS optimization, and multi‑account BI‑integrated reporting.
Subscription
Hex unifies notebooks, conversational queries, and dashboards in a single workspace. It uses shared semantic context to offer reliable insights from Snowflake, BigQuery, Redshift, and more. Data scientists write code, while business users ask plain‑language questions via Threads or Slack.
Freemium
- $36/mo
Lume automates end‑to‑end integration for software teams, discovering schemas and proposing mappings across ERPs, databases, APIs, and flat files. It generates production‑ready dbt models, SQL, and quality rules deployable to Snowflake or BigQuery, shortening cycles and improving data quality.
Free
Keebo is an AI-powered tool that enhances data storage and management through automated classification, seamless workflow integration, and real-time analysis, enabling efficient organization, collaboration, and informed decision-making for data-driven businesses.
H2O.ai delivers an end‑to‑end AI platform that automates feature engineering, model selection, and explainability through AutoML, offers no‑code LLM training, supports enterprise multi‑model orchestration, and includes MLOps and a feature store, all compliant with strict data security standards.
Free
Airbyte is an open-source data integration platform for building ELT/ETL pipelines with 600+ connectors, real-time replication and reverse ETL, low-code/custom connector development, and deployment options for cloud, private, and enterprise compliance controls.
Free trial
- $10/mo
LearnHouse is an open‑source LMS that lets educators create courses quickly with a block‑based editor. It supports self‑hosting, a REST API, payments, analytics, AI grading, multi‑tenancy, and integrations like YouTube and Google Analytics.
Subscription
Data On Demand consolidates structured, unstructured, and streaming data into a single source of truth, providing machine‑learning‑driven forecasting, anomaly detection, and decision optimization. It offers real‑time dashboards, AI alerts, and predictive models in a secure, collaborative workspace.
Free trial
Polar consolidates Shopify, Amazon, and POS data into a single dashboard, leveraging Snowflake for scalable queries. A semantic layer supplies pre‑built metrics, while AI agents deliver tailored insights. Incrementality tests validate marketing impact, and role‑based permissions control team access.
Free trial
Ocular AI unifies multimodal data from cloud, local, and external sources into a single catalog for search, versioning, and AI‑assisted labeling with human‑in‑the‑loop. It supports RLHF, GPU training pipelines, RESTful search API, and role‑based compliance controls.
Freemium
OpenHouse.ai consolidates sales, marketing, and operations data into a real‑time analytics engine that detects shifts in traffic, buyer behavior, pricing pressure, and sales velocity at the community level, diagnosing drivers and prescribing targeted pricing, incentive, and operational actions.
Subscription
HireLakeAI is an AI‑powered recruitment platform that parses resumes, matches candidates to job descriptions, scores communication skills, and outputs structured lists with standardized formatting. It integrates via API with HRMS/ATS systems to accelerate screening and improve hiring efficiency.
Free
HumanLayer is an open-source IDE and orchestration layer for AI coding agents, managing parallel Claude Code sessions, multiclaude workflows, worktrees and remote workers, with context-engineering tools, session replay, workflow templates and GitHub-integrated code-review automation.
Freemium
DataBrain is an embedded analytics platform that gives product teams and developers interactive dashboards, self‑service reporting, and AI‑powered insights. Its low‑code interface and SDK let users customize visualizations, connect to multiple data sources, and embed analytics into applications.
Subscription
- $999/mo
DreamHouse AI lets users upload a photo of any room or exterior and instantly generates a realistic interior layout in over 20 styles. It auto‑places furniture, decor, lighting, and architectural details, preserving existing structures, and allows natural‑language refinement.
Paid
- $39/mo
Secoda centralizes data cataloging, metadata management, and lineage tracking, offering AI‑driven search, query monitoring, and quality scoring. It provides role‑based access, CI/CD impact analysis, and real‑time observability dashboards to streamline workflows.
Free
DeepSense.ai provides end‑to‑end AI solutions for enterprises, integrating large language models, retrieval‑augmented generation, MLOps, advanced computer‑vision, edge inference, and predictive analytics to deliver scalable, real‑time AI agents, co‑pilots, and maintenance optimization.
Subscription
DataChain is a Python SDK and web platform that offers versioned dataset management and lineage tracking on S3, GCS, and Azure, enabling in‑storage data processing, reproducible pipelines, audit trails, collaboration, and secure compliance with SOC‑2 and GDPR.
Freemium
AiHouse is an AI‑powered platform that creates 2D/3D floor plans and renders detailed virtual houses in seconds. It offers 80 M 3D models, automatic customization, 4K photorealistic images, and integrates with JEGA Cloud for seamless production.
Freemium
- $9.99/mo
Dropbox Dash is an AI-driven search tool unifying data from connected apps & emails. It boosts productivity through smart collections, centralized views, and efficient answers for improved workflow management.
Freemium
LightLayer provides scalable, richly annotated egocentric datasets—synchronized RGB, audio, IMU, and depth—via distributed capture coordination, automated collection workflows, and streamlined annotation pipelines to produce delivery-ready data for embodied AI and robotic perception training.
Freemium
Lytics centralizes customer profiles across data warehouses via Cloud Connect, enabling audience segmentation, personalized email/web experiences, and product recommendation workflows. It supports generative AI insights and integrates with popular cloud services for marketing optimization.
Freemium
Durable turns plain‑English requirements into production‑ready code, automatically generating, testing, and deploying workflows across Salesforce, Snowflake, HubSpot, Google Workspace, and 50+ APIs. One‑click deployment, continuous monitoring, isolated containers, SOC 2 compliance, and audit‑ready s
Subscription
Leanware is a nearshore software development partner offering staff augmentation, AI integration, and custom web/mobile app development. They utilize a proprietary framework and U.S.-aligned teams to deliver efficient, high-quality digital solutions for businesses.
Freemium
LM Studio runs open‑source large language models locally on Mac (M‑series), Windows, and Linux, enabling private, offline inference. It offers command‑line and headless deployment, server‑side API, SDKs, a model hub, and LM Link for remote model access.
Free
Scale AI delivers a full‑stack generative‑AI platform that integrates enterprise data, supports fine‑tuning, RLHF, and model safety evaluation, and enables secure AI agent deployment with compliance‑certified cloud infrastructure for regulated and government use.
Freemium
Data Services by Clickworker provides a crowdsourced platform for data collection, validation, labeling, and categorization, assigning microtasks to a global workforce. It delivers scalable, ISO 27001‑compliant results and transparent workflow tracking for AI training and market research.
Freemium
- $13
Datayaki lets users ask plain‑English questions of spreadsheets, CSVs, and cloud databases (Postgres, Supabase, Snowflake, Firebase) directly in the browser. It keeps data local, offers explainable AI, and supports secure, collaborative dashboards.
Freemium
Unsloth Studio is a no-code web UI enabling local training, running, and exporting of open AI models like Qwen3.5 and NVIDIA Nemotron 3, simplifying experimentation for users without extensive technical expertise.
Free
Lazy Admin" is an AI reporting tool for Salesforce offering real-time responses in human language, data protection, AI search capabilities, customizable reporting, and effortless data visualization. It saves time by providing instant insights and enhancing productivity.
Free trial
Leasecake centralizes lease documents, clauses, and renewal data, automates risk detection and obligation alerts, syncs lease accounting with ASC 842, tracks transactions, and provides portfolio analytics to uncover savings, risks, and expansion opportunities.
Freemium
Tinybird is a data platform for high-throughput streaming ingestion and management of large datasets. It features zero downtime schema migrations, instant SQL APIs, and seamless integration with tools like Kafka and S3, ensuring reliable data operations.
Subscription
Label Studio is an open‑source platform for labeling images, audio, text, video, time‑series, and PDFs. It offers customizable interfaces, pre‑labeling with ML, multi‑project support, API/SDK integration, and quality gates that ensure consistent annotations, with export to CSV or databases.
Freemium
- $10
Echobase is a centralized platform for querying, creating, and analyzing data. It enables real-time collaboration, integrates with document management systems, and supports advanced AI model training, enhancing productivity and workflow within teams.
Free trial
DataSquirrel.ai automates data cleaning, analysis, and visualization for business users, enabling quick chart creation, KPI dashboards, and custom reports without coding. It supports scheduled refreshes, GDPR compliance, and interactive sharing for teams and consultants.
Paid
- $15
FiftyOne is a visual AI platform that centralizes data curation, annotation, and model evaluation across images, video, point clouds, and metadata. It offers interactive slicing, automatic labeling with confidence scoring, role‑based access, versioning, and open‑source integration.
Free
June is an AI‑driven analytics platform for B2B SaaS that lets teams query user data via SQL or natural language. It connects to Salesforce, HubSpot, Attio, and Twilio Segment, auto‑generates reports, shares queries, and meets SOC 2 Type II/GDPR compliance.
Paid
Gamma.AI is a cloud DLP tool integrated with Palo Alto Networks CASB that automatically discovers and classifies data across 150+ SaaS apps with 99.5% accuracy. It offers one‑click deployment, real‑time remediation, and API connectors for SIEM/SOAR integration.
Freemium
DealMachine offers investors a platform to locate off‑market properties nationwide, apply 700+ filters to a 150M+ database, and access owner contact info, automated outreach, and AI‑driven analysis for efficient lead generation and deal closure.
Paid