Build, run, and manage data pipelines for integrating data
Python ETL framework for stream processing, real-time analytics, LLM
A Python package for interactive geospaital analysis and visualization
The power of Chart.js with Python
A cross-platform installer for the Julia programming language
Monitor the stability of a Pandas or Spark dataframe
Train machine learning models within Docker containers
An open source multi-tool for exploring and publishing data
Docker image used to run data processing workloads
Pythonic tool for running machine-learning/high performance workflows
Making DAG construction easier
Metadata and data identification tool and Python library
High-Performance Symbolic Regression in Python and Julia
Data science on data without acquiring a copy
An orchestration platform for the development, production
Benchmarking synthetic data generation methods
Detecting silent model failure. NannyML estimates performance
Main repository for Vispy
Uncover insights, surface problems, monitor, and fine tune your LLM
CKAN is an open-source DMS for powering data hubs
Library providing end-to-end GPU-accelerated recommender systems
Python module that helps you build complex pipelines of batch jobs
Scale your Pandas workflows by changing a single line of code
Production-ready data processing made easy and shareable
Best practices on recommendation systems