pynsee icon indicating copy to clipboard operation
pynsee copied to clipboard

fastparquet : dependency seems to be missing

Open hadrilec opened this issue 1 year ago • 2 comments

Hello, while running this piece of code:

from pynsee.sirene import search_sirene

# Get a list of hospitals in Paris
df = search_sirene(variable = ["activitePrincipaleUniteLegale",
                               "codePostalEtablissement"],
                   pattern = ["86.10Z", "75*"], kind = "siret")

I got this error:

Unable to find a usable engine; tried using: 'pyarrow', 'fastparquet'.
A suitable version of pyarrow or fastparquet is required for parquet support.
Trying to import the above resulted in these errors:
 - Pandas requires version '10.0.1' or newer of 'pyarrow' (version '8.0.0' currently installed).
 - Missing optional dependency 'fastparquet'. fastparquet is required for parquet support. Use pip or conda to install fastparquet.

After downloading, fastparquet the error disappeared. @tfardet shall we add this package in the dependencies?

hadrilec avatar Aug 24 '24 08:08 hadrilec

This is strange as the pyarrow dependency is in requirements.txt. My hypothesis would be that you might be using pynsee directly from path (since you need to be able to develop and test in real time) and you did not update the requirements since I added parquet?

There is no need to have fastparquet and pyarrow, so since pyarrow is already listed as a dependency, we should be fine.

tfardet avatar Aug 25 '24 01:08 tfardet

@hadrilec can we close this or do you still think there is an issue?

tfardet avatar Sep 17 '24 07:09 tfardet