Anatoly Myachev

Results 135 issues of Anatoly Myachev

Modin 0.27.0 ```python import modin.pandas as pd import numpy as np df = pd.DataFrame(np.random.rand(100, 100)) df2 = df.iloc[:0] pd.merge(df, df2)

bug 🦗
P1

## What do these changes do? This is an example of how DataFrame API protocol implementation might look like if it will be added directly to Modin, instead of https://github.com/data-apis/dataframe-api-compat....

## What do these changes do? These changes are waiting for https://github.com/data-apis/dataframe-api-compat/pull/71. - [x] first commit message and PR title follow format outlined [here](https://modin.readthedocs.io/en/latest/development/contributing.html#commit-message-formatting) > **_NOTE:_** If you edit the...

This module is public and is used quite often. It shouldn't be difficult to maintain, as it has a few functions: ```python __all__ = [ "assert_extension_array_equal", "assert_frame_equal", "assert_series_equal", "assert_index_equal", ]...

new feature/request 💬
pandas concordance 🐼
P2

For reference: https://github.com/pandas-dev/pandas/pull/54383

new feature/request 💬
P1

## What do these changes do? - [x] first commit message and PR title follow format outlined [here](https://modin.readthedocs.io/en/latest/development/contributing.html#commit-message-formatting) > **_NOTE:_** If you edit the PR title to match this format,...

`_propagate_index_objs` changes the index of all partitions, even if the new index is the same as the old one. We can try to avoid this and only update where the...

Performance 🚀
P2
Internals

This implies a decrease `num_returns` parameter. Results from example: ```bash 2023-07-15 00:19:36,263 INFO worker.py:1544 -- Started a local Ray instance. View the dashboard at 127.0.0.1:8265 time second option call: 2.3153023719787598...

Performance 🚀
question ❓
Ray ⚡
P2

For consistency with other functions like `read_csv_glob` and `read_parquet_glob`.

Code Quality 💯
pandas.io
P2

As an example, one can use the more complex implementation of `read_csv_glob` function, which can create several partitions from 1 file if necessary. However, we need to keep in mind...

Performance 🚀
pandas.io
P2