mongo-arrow icon indicating copy to clipboard operation
mongo-arrow copied to clipboard

MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.

Results 26 mongo-arrow issues
Sort by recently updated
recently updated
newest added
trafficstars

Hi, _arrow_to_polars currently has no support to cast extension types for nested fields. This prohibits ObjectIDs to be read in case they are in nested fields. I could not manage...

Hi, This might look like a stupid bug report at first glance, but let me explain: Assume a master service is reading data from MongoDB where the data is written...

linked-to-jira

Hi, I really appreciate the support for polars, but ``` collection.insert_one({'obj': {'data_to_test': bson.ObjectId()}}) pymongoarrow.api.aggregate_polars_all(collection, [], schema=pymongoarrow.api.Schema({'obj': {'data_to_test': bson.ObjectId}})) ``` raises with `polars.exceptions.ComputeError: cannot create series from Extension("pymongoarrow.objectid", FixedSizeBinary(12), Some(""))` as...

Current pin in version 1.4.0 is "pyarrow>=16.0,

dependencies

I have a use-case where I want to extract data as an arrow table, save as parquet, and then later load it with polars. My problem is that I cannot...

``` import pymongo import pymongoarrow print(pymongo.__version__, pymongoarrow.__version__) from pymongoarrow.monkey import patch_all patch_all() ``` works with: ``` 4.8.0 1.5.1 ``` fails with: ``` 4.9.1 1.5.1 ``` Exception ``` --------------------------------------------------------------------------- ModuleNotFoundError Traceback...

Having updated pip to latest version and installed libbson with yum I am still getting the following error any pointers would be great The exit code: 1 The error output...

https://github.com/mongodb-labs/mongo-arrow/blob/fffe5bb8634fda82309429e96fdc65a0c33ce9cd/bindings/python/pyproject.toml#L40 "packaging >=23.2,

bug

**pymongoarrow replaces existing codecs** The api.write() method replaces any existing codecs. The collection TypeRegistry is replaced with a new instance, effectively removing any existing custom codecs. https://github.com/mongodb-labs/mongo-arrow/blob/1.6.0/bindings/python/pymongoarrow/api.py#L419 A related issue...

Hi, I hope you're all having a wonderful day. I have a redshift table that includes 4 columns, two of the columns are string version of ObjectId. I load the...