pandas-streaming
pandas-streaming copied to clipboard
Streaming API for pandas applied to big datasets
Like receiving `io` streams and quantization criterion, `Kafka` and other remote queues in a similar way like Spark streaming works.
Pip doesn't install all required packages. It seems that scikit-learn and ijson are missing. The steps to demonstrate that scikit-learn is missing: ```bash python -m venv venv source venv/bin/activate pip...
Reply to issue #41.
`train_test_apart_stratify()` produces different results for the same input data, even when setting `random_state=0`. To reproduce this, I've adapted the example from the function's docstring to contain only strings (i.e., the...