mongo-arrow issues

Add an optional bool flag to the `write` function to skip writing `null` fields

1

### Function parameters example ```py def write(collection, tabular, *, exclude_none: bool = False): ... ``` ### Usage example ```py write(collection, df, exclude_none=True) ``` ### How Replacing https://github.com/mongodb-labs/mongo-arrow/blob/main/bindings/python/pymongoarrow/api.py#L390 with ```py if...

alessio-locatelli

enhancement

Trouble reading documents with empty embedded arrays

6

Goal: Trying to read a mongo document with an embedded object containing an empty array to a pyarrow table, then write it out as a parquet file. Expected result: Parquet...

ccrouch

bug

undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKSs with airflow 2.8.1

1

I'm reproducing a bug in airflow with the docker-compose method to run airflow2.8.1 with python 3.11 ( https://airflow.apache.org/docs/apache-airflow/2.8.1/howto/docker-compose/index.html#fetching-docker-compose-yaml ). I'm creating a requirements.txt with the following packages : ``` pymongo==4.6.1...

alexisvannier

documentation

MongoDB's Decimal128 seems to be returned as fixed_size_binary[16]

3

Hi, when I use pymongoarrow.api.aggregate_arrow_all() it seems to return Decimal128 as FixedSizeBinary when [context.finish()](https://github.com/mongodb-labs/mongo-arrow/blob/main/bindings/python/pymongoarrow/context.py#L114) is called. When looking at the code, my assumption is, it stems from [lib.pyx](https://github.com/mongodb-labs/mongo-arrow/blob/main/bindings/python/pymongoarrow/lib.pyx#L763) where `return...

K-to-the-D

duplicate

Does mongo-arrow provide real zero copy in the chain mongodb->arrow->pandas?

3

.. or zero copy appear only between `arrow->pandas` but not here `mongodb->arrow`? In other words are arrow data types used in mongodb?

sergun

answered

Can `find_all_pandas` treat list of struct as nested dataframe?

1

I have a mongo document which has a list field containing child documents. Pandas data frames [can be nested](https://pandas.pydata.org/docs/user_guide/dsintro.html#dataframe). And PyArrow has `Table` and `RecordBatch` types. I would like to...

anentropic

linked-to-jira

aggregate_arrow_all(...) >four times slower in version 1.0.2 compared to 1.0.1 with fields objects

5

Hi, Thanks again for fixing the bugs in Version 1.0.2. Unfortunately it seems that the new version loads data approx.. >four times slower in case there are nested fields in...

sibbiii

linked-to-jira

Casting timestamp in find_panads_all()

1

Hi, i'm facing this issue when to try make my mongo collection into pandas dataframe using the find_pandas_all() function authors_pyarrow = Schema({"_id": ObjectId, "first_name": pyarrow.string(), "last_name": pyarrow.string(), "date_of_birth": datetime}) df...

OS1ZA

waiting-for-author

Dataframe is all Nat and None after loading

6

I was trying mongo arrow to load a dataset from mongodb, it is loading the selected columns only that's saving space, but the dataframe is all Nat and Nones only....

Sondos-Omar

aggregate_arrow_all does not return column of fields with "null" values only

1

Hi, when using pymongoarrow.api.aggregate_arrow_all() it seems to omit columns that would contain only null values. #### Field "email" with None only ```python data = [ {"name": "Charlie", "email": None}, {"name":...

K-to-the-D

mongo-arrow
mongo-arrow copied to clipboard

Metadata

Add an optional bool flag to the `write` function to skip writing `null` fields

Trouble reading documents with empty embedded arrays

undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKSs with airflow 2.8.1

MongoDB's Decimal128 seems to be returned as fixed_size_binary[16]

Does mongo-arrow provide real zero copy in the chain mongodb->arrow->pandas?

Can `find_all_pandas` treat list of struct as nested dataframe?

aggregate_arrow_all(...) >four times slower in version 1.0.2 compared to 1.0.1 with fields objects

Casting timestamp in find_panads_all()

Dataframe is all Nat and None after loading

aggregate_arrow_all does not return column of fields with "null" values only

← Metadata

Owner

Metadata

mongo-arrow mongo-arrow copied to clipboard

Metadata

← Metadata

Owner

Metadata

mongo-arrow
mongo-arrow copied to clipboard