AutoProfiler icon indicating copy to clipboard operation
AutoProfiler copied to clipboard

Support for other DataFrame libraries

Open lucas-nelson-uiuc opened this issue 1 year ago • 2 comments

Wondering if there's been discussion for supporting other dataframe (or pandas-compatible) libraries? Happy to talk through or help develop support to extend the ecosystem.

lucas-nelson-uiuc avatar May 19 '24 19:05 lucas-nelson-uiuc

Hey thanks for starting discussion on this, I think would be really cool. A couple of libraries that come to mind, curious which ones you or others are using the most that would be the best to support

  • Polars
  • Ibis
  • Modin -- this might work with minimal overhead since has exact same API as Pandas as my understanding
  • Raw numpy matricies (if 2d)

willeppy avatar May 25 '24 12:05 willeppy

Generally, in terms of implementation I think would involve...

  • Python pandas executor to support different libraries and detect different dataframe types in the front end code that is executed (https://github.com/cmudig/AutoProfiler/blob/main/src/dataAPI/jupyter/PythonExecutor.ts)
  • And then implementing the required functions in python like we have for pandas (https://github.com/cmudig/AutoProfiler/blob/main/digautoprofiler/profile_lib.py)

willeppy avatar May 25 '24 13:05 willeppy