Rehan Sohail Durrani

Results 30 comments of Rehan Sohail Durrani

I agree with @vnlitvinov - that way we don't have to have a complicated set of rules for whose commits are formatted in a specific way for which people, but...

Hi folks! The `sort_by` is nearly complete - we're just tracking down the last few CI bugs, and hope to get this in by end of week! One open question...

I did some rough benchmarking, and noticed that the new sort is sensitive to skew in the data (each of the tests except the one for the sort algo below...

These experiments were run on an m5.2xlarge with 512 GiB of memory. The following script was used: ```python from time import perf_counter import ray import numpy as np from modin.config...

One quick update is I noticed that the partition skews were actually due to two things - how we sample, as well as the fact that I was setting the...

On a 5_000_000 million row x 100 col df, we get the following times: | pandas | Modin Master | Modin This Branch | | ------- | -------------- | ------------------...

@vnlitvinov has this issue been resolved? I believe we've synchronized a lot of our requirements, but are we still hoping to have one source of truth for requirements instead of...

Hi @lemerchand! `sort_values` currently defaults to pandas; however, we are working on a fix for this #4601 - after which you shouldn't receive that warning anymore! In general, Modin defaults...

Ah I see - I think this is because the `sort_values` is currently implemented by gathering the full columns we are sorting by on the head node, sorting them, and...