Open Source Data Science
The best 50 Open Source Data Science AI tools - Free & Paid
Explore 50 AI for Open Source Data Science
Open Knowledge Maps is an AI search engine that visualizes scientific literature across disciplines, clustering related papers to reveal topic connections and trends. It supports varied document types, offers high‑quality metadata, multilingual browsing, and open‑source integration.
Freemium
DataCamp provides interactive courses, hands-on projects, and role-based career and skill tracks for data science, ML, and AI. It covers Python, R, SQL, cloud platforms, LLMs, and MLOps, plus team analytics and customizable learning paths.
Freemium
OpenDoc AI is an advanced productivity tool that simplifies data science tasks with customizable automation, ready-made workflows, and plain English queries for instant data insights. Streamline tasks, integrate AI tools effortlessly, and boost data analytics efficiency.
Free trial
PandasAI is an open-source tool for conversational data analysis that allows users to query data in natural language. It integrates various data sources, provides real-time insights, and generates detailed reports and visualizations for effective decision-making.
Subscription
Sourcetable is an AI‑powered spreadsheet platform that lets users query data in plain English, auto‑generate charts, Python/SQL code, and clean data. Built‑in connectors link to databases and apps, while templates enable quick reporting.
Freemium
- $20/mo
OpenCode.ai is an open-source AI coding agent that runs directly in your terminal, IDE, or desktop. It connects to 75+ LLM providers, supports offline use, and enables multi-session collaboration for code review and debugging.
Free
Kanaries transforms raw data into interactive visual insights with AI‑assisted code completion for Pandas, RStudio, and Jupyter. Drag‑and‑drop chart building, natural‑language chat, real‑time collaboration, and offline desktop support streamline the entire exploration workflow across web and desktop
Subscription
Open Apps is an open-source app directory that offers a curated selection of free alternatives to popular software tools, enabling users to find quality open-source solutions across various categories for development and productivity needs.
Free
Mevo is an open‑source platform that lets developers and data scientists host and customize their own instances on any OS or cloud. With GitHub‑hosted code, full documentation, and modular architecture, it supports integrations and ensures data privacy and compliance.
Free
Globe Explorer is an AI-driven platform for data analysis and trend identification, offering robust topic discovery, visual data representations, insightful reports, and collaborative features to enhance research for educators, researchers, and content creators.
Freemium
Julius AI connects spreadsheets, databases, and cloud storage, letting users ask natural‑language questions. It delivers instant charts, tables, and reports, sharable in Slack or on a schedule, and supports no‑code plus R, Python, or SQL workflows, keeps data private.
Free
AI and data analytics platform delivering end‑to‑end solutions across multiple sectors. It accelerates experimentation to production, supports data engineering, MLOps, LLMOps, and digital engineering, integrating Databricks, Snowflake, and Google Cloud to shorten insight‑to‑action time and boost eff
Subscription
Hex unifies notebooks, conversational queries, and dashboards in a single workspace. It uses shared semantic context to offer reliable insights from Snowflake, BigQuery, Redshift, and more. Data scientists write code, while business users ask plain‑language questions via Threads or Slack.
Freemium
- $36/mo
Data On Demand consolidates structured, unstructured, and streaming data into a single source of truth, providing machine‑learning‑driven forecasting, anomaly detection, and decision optimization. It offers real‑time dashboards, AI alerts, and predictive models in a secure, collaborative workspace.
Free trial
OpenHouse.ai consolidates sales, marketing, and operations data into a real‑time analytics engine that detects shifts in traffic, buyer behavior, pricing pressure, and sales velocity at the community level, diagnosing drivers and prescribing targeted pricing, incentive, and operational actions.
Subscription
Data Science Jobs is a specialized job board that connects professionals with opportunities in AI, data science, machine learning, and related fields. It offers user-friendly filters for location, industry, and specialization, making job searches efficient and targeted.
Free
Secoda centralizes data cataloging, metadata management, and lineage tracking, offering AI‑driven search, query monitoring, and quality scoring. It provides role‑based access, CI/CD impact analysis, and real‑time observability dashboards to streamline workflows.
Free
Quadratic is an AI‑enabled spreadsheet that connects to CSV, Excel, PDFs, and databases like Postgres and Snowflake. It lets users filter, clean, and analyze data in a grid, ask natural‑language queries, and generate editable Python and SQL code for visualizations.
Subscription
Openkoda is an open‑source insurtech platform providing modular templates for claims, policy, and embedded insurance. It offers AI analytics, automated documents, role‑based access, multi‑tenant clustering, and API hooks for rapid, scalable development without vendor lock‑in.
Freemium
Airbyte is an open-source data integration platform for building ELT/ETL pipelines with 600+ connectors, real-time replication and reverse ETL, low-code/custom connector development, and deployment options for cloud, private, and enterprise compliance controls.
Free trial
- $10/mo
FreedomGPT unifies access to 400+ AI models, showing side‑by‑side answers for voting and auto‑selection via leaderboard. It keeps privacy safe, runs on Windows/macOS, and is open‑source for community contribution and collaboration.
Free
Learn AI, ML, and data science through free tutorials, live coding playgrounds, and 100+ hands‑on projects. The curriculum covers core machine learning, regression, and deep learning, with specialized projects and a 3,958‑question quiz to reinforce knowledge.
Free
Ask Data is an open-source, chat-based tool that enables users to create and manage data pipelines using natural language commands. It simplifies data integration, cleansing, and transformation, making data engineering accessible to both technical and non-technical users.
Free trial
DataSquirrel.ai automates data cleaning, analysis, and visualization for business users, enabling quick chart creation, KPI dashboards, and custom reports without coding. It supports scheduled refreshes, GDPR compliance, and interactive sharing for teams and consultants.
Paid
- $15
Scoop Analytics is an AI-powered platform that analyzes CRM, marketing, and sales data in real-time. It offers customizable reports and dashboards, helping business teams monitor key metrics and make data-driven decisions effectively.
Free trial
Emergent Mind collects recent arXiv papers, categorizes by topic or author, offers concise summaries, in‑depth analyses, whiteboard and video renderings, plus community‑driven email digests, helping researchers, students, educators, and industry professionals locate and explain literature quickly.
Freemium
Basedash lets teams ask plain‑English questions of their data warehouses and SaaS sources, automatically generating validated SQL, executing it, and visualizing results in dashboards. It supports 750+ integrations, enforces SOC 2 compliance, and offers an embedding API for internal products.
Paid
Open Notebook is a self-hosted, open-source notebook for private LLM workflows, supporting over 16 AI providers. It enables multi-modal content management, vector search, and contextual chat with full data sovereignty for research and development teams.
Freemium
Open‑source AI code‑review platform that plugs into GitHub, GitLab, Bitbucket, and Azure DevOps at the pull‑request level. Model‑agnostic, it runs custom rule sets, tracks technical debt, and delivers real‑time metrics without storing source code.
Freemium
AI agents scan 300,000+ sources—including dark‑web forums and new domains—to deliver real‑time OSINT alerts with context on threat actors, intent, and campaigns. Customizable workflows target phishing, insider risk, or credential leaks, enabling rapid response and fraud reduction.
Freemium
FiftyOne is a visual AI platform that centralizes data curation, annotation, and model evaluation across images, video, point clouds, and metadata. It offers interactive slicing, automatic labeling with confidence scoring, role‑based access, versioning, and open‑source integration.
Free
Storytell.ai converts messy data into clear narratives using 945 prompts. It accepts files, images, audio, URLs and augments insights with news, social media, and research. Ideal for data scientists, marketers, analysts, it complies with SOC2, GDPR, and HIPAA.
Freemium
- $20/mo
HeyScience assists students, tutors, and researchers by gathering sources, outlining essays, and generating drafts. It tracks assignment progress, highlights key concepts, offers citation suggestions, and integrates with LMS for reference import. Users refine drafts with AI feedback.
Free
Arcwise monitors business data for early metric shifts, integrating with Snowflake, BigQuery, Databricks and BI tools. It provides traceable, explainable AI insights and decision‑ready dashboards, preserves institutional knowledge, and meets enterprise security and compliance standards.
Free
Weights & Biases is an AI developer platform that simplifies machine learning experiments with tools for tracking, visualizing, and optimizing models. It enhances workflow efficiency through interactive visualizations and collaboration features.
Freemium
OpenDeepResearcher is an AI-powered research tool that streamlines information gathering by refining search queries, filtering duplicates, and generating comprehensive reports. It features asynchronous processing and a user-friendly Gradio interface for efficient research across various topics.
Subscription
- $19/mo
An AI-powered code-writing assistant that comprehends data content available on GitHub.
Free
n8n is an open‑source workflow automation platform with a visual canvas and custom JavaScript/Python support. It connects to 500+ integrations, enables AI agents and RAG, offers audit logs, real‑time alerts, and can be self‑hosted on Docker or Kubernetes.
Free
Columns.ai is a data visual storytelling AI tool for creating appealing data visual stories. It uses ChatGPT to generate insightful responses to data-related prompts and offers customization options for interactive visualizations.
Freemium
Browse AI enables code‑free web scraping and automation via a point‑and‑click interface. It captures dynamic, paginated, login‑protected data, auto‑detects site changes, exports to CSV/JSON/AWS S3, and streams into Google Sheets, Airtable, Zapier, APIs, and more.
Freemium
- $48.75/mo
AskCSV lets users upload CSV/TSV files in the browser, query data locally, and receive automatic charts, tables, and insights—such as top products or ROI—while preserving privacy and requiring headers for accurate processing.
Freemium
Nebius AI Studio offers efficient model deployment with hosted open-source models, ultra-low latency, and scalable processing options. It simplifies AI model exploration through an intuitive interface while ensuring verified quality and performance for diverse applications.
Free trial
Data Services by Clickworker provides a crowdsourced platform for data collection, validation, labeling, and categorization, assigning microtasks to a global workforce. It delivers scalable, ISO 27001‑compliant results and transparent workflow tracking for AI training and market research.
Freemium
- $13
Spark Beta by Mixpanel is an AI tool that uses natural language processing to provide insights on product, marketing, and revenue questions. It offers efficient report generation and CEO insights, while simplifying data management for better decision-making.
Subscription
- $20/mo
Supadash lets users connect to SQL databases like PostgreSQL or Supabase and automatically turns SELECT results into time‑series and bar charts without manual coding. It supports unlimited charts per project, offers AI‑generated layouts, and stores no user data.
Subscription
- $7/mo
Databar.ai is a data enrichment platform that connects to 100+ data providers and AI services. It imports company/lead lists, adds 450+ enrichment fields via drag‑and‑drop, syncs with major CRMs, and offers real‑time intent signals for targeted outbound campaigns.
Subscription
- $99/mo
Xdash AI offers seamless data analysis, in-depth reporting, and task automation. It excels at uncovering crucial insights from intricate datasets, facilitating informed business decisions.
Freemium