Anatoly Myachev

Results 197 comments of Anatoly Myachev

> Looking at this, isnt The Right Way to handle this to copy pandas' _MergeOperation code and adapt it? think so, im just not familiar with this code. will look

@AndreyPavlenko thanks for contribution! Please also add a test and a release note into `docs\release_notes\release_notes-0.16.0.rst`.

After fixing problem above it's needed to fix another one initially found in https://github.com/modin-project/modin/pull/4283#discussion_r919048273- (buffer with non-zero starting position). Reproducer: ```python import modin.pandas as pd import pandas df = pd.DataFrame({"col1":...

Let me also test the performance for large dataframes before merging. UPD added results: Shape: rows=150.0M x ncols=10 | script part | pandas | modin master 16 cores | this...

> LGTM Happy to merge but there are the following issues in our CI: `ValueError: buffer source array is read-only`. I don't have fix for this for now.

> > CI: ValueError: buffer source array is read-only > > does this happen on pandas main too? I don’t know, I don’t have the right environment for this at...

Hi @c3-cjazra. Thanks for posting! The bug is fixed on 0.14.0. It is possible for you update Modin version?

Blocked by https://github.com/modin-project/modin/issues/3572

@c3-cjazra the main problem here is that we haven't fully switched to using `fsspec`, so reading multiple files can only be for s3, because we are explicitly using `s3fs`. This...

Some problems when trying to enable abfs-like URLs: https://github.com/fsspec/adlfs/issues/319.