gpubootcamp icon indicating copy to clipboard operation
gpubootcamp copied to clipboard

[HPC_AI][AI for Science Climate] Text pre-processing notebook throws pandas error

Open pzharrington opened this issue 3 years ago • 2 comments

In file Pre-Processing_Text_data.ipynb, there is a pandas call that is out of date for pandas>=1.4, but the Dockerfile does not specify a specific pandas version during installation. The line in question is:

pd.set_option("max_columns", None)

Should be updated to

pd.set_option("display.max_columns", None)

Otherwise, pandas<1.4 should be specified in the Dockerfile. See https://github.com/pandas-dev/pandas/issues/45619

pzharrington avatar Aug 10 '22 18:08 pzharrington

Valid observation. Will be updated

bharatk-parallel avatar Sep 07 '22 10:09 bharatk-parallel

Thank you @pzharrington for bringing it up.

I've created a pull request #112 wherein I had set the pandas version explicitly to 1.3.5 and had built and tested the container for reliability.

Requesting @bharatk-parallel to review and merge the request.

aswinkumar1999 avatar Sep 20 '22 04:09 aswinkumar1999

Fixed in #112

Closing issue.

aswinkumar1999 avatar Oct 13 '22 15:10 aswinkumar1999