Matthew Powers
Matthew Powers
I want to make CodeQuizzes beautiful - help me!
http://stackoverflow.com/questions/4136248/how-to-generate-a-human-readable-time-range-using-ruby-on-rails This would be an interesting challenge. It can be solved iteratively or recursively.
It should be relatively straightforward to build a Dask Delta writer that outputs the JSON metadata that powers Delta. Building the whole Delta Engine writer is hard, but outputting the...
I played around with [delta-rs](https://github.com/delta-io/delta-rs) and think that'd be an even easier way to implement the Delta read functionality. Here's how we can get all the files that need to...
Thanks for making and maintaining this great project for so long. I've been using Janus for years, ever since reading [Yehuda's blog](https://yehudakatz.com/2010/07/29/everyone-who-tried-to-convince-me-to-use-vim-was-wrong/). Is this project going to be maintained anymore?...
Dask can perform groupby operations relatively well for columns with low cardinality, but performance seems to degrade significantly for columns with more distinct values. Let's look at the h2o groupby...
The `to_csv` method outputs filenames with a `.part` extension by default. This post argues that `to_csv` should output CSV files with a `.csv` extension by default. Let's create a DataFrame...
Pandas 1.3 includes an experimental `string[pyarrow]` dtype that's much more memory efficient than `object` columns. Rocklin [made a video on this new dtype](https://www.youtube.com/watch?v=_zoPmQ6J1aE&t=145s&ab_channel=Coiled) which shows how the `string[pyarrow]` type greatly...