Matthew Rocklin

Results 1038 comments of Matthew Rocklin

I don't think we need to impose this on contributors. Instead, when merging, we can select the "Squash and Merge" option for the green button. This will do the squashing...

I think that this is possible On Wed, Jan 17, 2018 at 11:24 AM, Christopher J. Wright < [email protected]> wrote: > It seems that the tests are not included in...

You might want to look at [Stream.from_textfile](http://streamz.readthedocs.io/en/latest/api.html#streamz.from_textfile)

Probably something like the following: ```python source = Stream.from_textfile(...) example = pd.DataFrame(...) # provide an empty example to tell streamz about column names and dtypes df = source.map(parse).timed_window(0.5).to_dataframe(example=example) # do...

> or an accumulated dataframe that steadily grows with time and includes all data so far? I'm not sure how you would achieve the latter. I think that this is...

> Probably a better example would be to buffer lines of text until some threshold Yes I agree. Things like this already exist within streamz. You might consider `partition` or...

Are the examples provided above insufficient to get you started @apiszcz ?

That would be ok. If you have some data already then you might also read a little bit of your file and pass that dataframe instead (the example doesn't have...

Thanks for the error report. I've improved the error message in https://github.com/mrocklin/streamz/pull/106

df = source.map(parse).timed_window(0.5).to_dataframe(example=example) to_dataframe will also expect to be given a stream of dataframes. So assuming that the output of parse is something like a python dict, we might do...