pydata topic

List pydata repositories

pandas-datareader

2.8k
Stars
673
Forks
Watchers

Extract data from a wide range of Internet sources into a pandas DataFrame.

dask

12.1k
Stars
1.7k
Forks
Watchers

Parallel computing with task scheduling

cudf

7.5k
Stars
846
Forks
Watchers

cuDF - GPU DataFrame Library

koalas

3.3k
Stars
354
Forks
Watchers

Koalas: pandas API on Apache Spark

stumpy

3.0k
Stars
285
Forks
Watchers

STUMPY is a powerful and scalable Python library for modern time series analysis

pyjanitor

1.3k
Stars
167
Forks
Watchers

Clean APIs for data cleaning. Python implementation of R package Janitor

sgkit

213
Stars
33
Forks
Watchers

Scalable genetics toolkit

pydata-sphinx-theme

541
Stars
293
Forks
Watchers

A clean, three-column Sphinx theme with Bootstrap for the PyData community

array-api

203
Stars
41
Forks
Watchers

RFC document, tooling and other content related to the array API standard

distributed

1.5k
Stars
710
Forks
Watchers

A distributed task scheduler for Dask