Matthew Powers

Results 218 issues of Matthew Powers

I want to make CodeQuizzes beautiful - help me!

enhancement

Can someone please help me add specs to this site!?

enhancement

http://stackoverflow.com/questions/4136248/how-to-generate-a-human-readable-time-range-using-ruby-on-rails This would be an interesting challenge. It can be solved iteratively or recursively.

It should be relatively straightforward to build a Dask Delta writer that outputs the JSON metadata that powers Delta. Building the whole Delta Engine writer is hard, but outputting the...

I played around with [delta-rs](https://github.com/delta-io/delta-rs) and think that'd be an even easier way to implement the Delta read functionality. Here's how we can get all the files that need to...

Thanks for making and maintaining this great project for so long. I've been using Janus for years, ever since reading [Yehuda's blog](https://yehudakatz.com/2010/07/29/everyone-who-tried-to-convince-me-to-use-vim-was-wrong/). Is this project going to be maintained anymore?...

Dask can perform groupby operations relatively well for columns with low cardinality, but performance seems to degrade significantly for columns with more distinct values. Let's look at the h2o groupby...

dataframe
discussion
parquet

The `to_csv` method outputs filenames with a `.part` extension by default. This post argues that `to_csv` should output CSV files with a `.csv` extension by default. Let's create a DataFrame...

dataframe
good second issue
enhancement

Pandas 1.3 includes an experimental `string[pyarrow]` dtype that's much more memory efficient than `object` columns. Rocklin [made a video on this new dtype](https://www.youtube.com/watch?v=_zoPmQ6J1aE&t=145s&ab_channel=Coiled) which shows how the `string[pyarrow]` type greatly...

discussion
needs attention