Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
An implementation of the Grammar of Graphics in R
RStudio is an integrated development environment (IDE) for R
Scalable and Flexible Gradient Boosting
Positron, a next-generation data science IDE
Data science spreadsheet with Python & SQL
Vector database for scalable similarity search and AI applications
Automatic extraction of relevant features from time series
A framework for real-life data science
Data science on data without acquiring a copy
Train machine learning models within Docker containers
The Go kernel for Jupyter notebooks and nteract
Project structure for doing and sharing data science work
Parallel computing with task scheduling
Graphical User Interface Toolkit for Python with minimal dependencies
Always know what to expect from your data
High-Performance Serverless event and data processing platform
Detecting silent model failure. NannyML estimates performance
A reactive notebook for Python
Build data pipelines, the easy way
GPU DataFrame Library
The data science OS
For building machine learning (ML) workflows and pipelines on AWS
Solutions and Notes for Labs of Computer Systems
Course materials for the Data Science Specialization on Coursera
Library providing end-to-end GPU-accelerated recommender systems