lens issues

Plotly requirements update

1

PR raised in response to #44 > Prior to version 4, this library could operate in either an "online" or "offline" mode. The documentation tended to emphasize the online mode,...

mm5631

Replace deprecated `DataFrame.get_values` with `to_numpy`

Fixes #47

zblz

pandas==1.0.1 and lens.summarize(df) throws Error

1

the dataFrame method "get_values" doesn't exist any more I downgraded pandas to '0.25.0' to make it work. the current [setup.py](https://github.com/facultyai/lens/blob/master/setup.py) requires pandas but doesn't specify a version.

rmminusrslash

can't install lens despite all solutions i googled

it gives me the error > error: Microsoft Visual C++ 14.0 is required. Get it with > "Build Tools for Visual Studio": https://visualstudio.microsoft.com/downloads/ > ---------------------------------------- > ERROR: Command errored out...

erjan

Compatibility issue with new version of plotly?

3

Hi guys, getting the following error with the explorer module: ``` AttributeError Traceback (most recent call last) in ----> 1 explorer.correlation_plot() ~/machine_learning/.env/lib/python3.7/site-packages/lens/explorer.py in correlation_plot(self, include, exclude) 311 """ 312 fig...

mm5631

WIP: Allow dask dataframes as input to lens.summarise

This is work in progress: DO NOT MERGE This PR adapts the summarise functionality to be able to take a [dask dataframe](https://dask.pydata.org/en/latest/dataframe.html), which will allow to take in larger-than-memory datasets...

zblz

Use matplotlib instead of plotly in Explorer

Plotly has the advantage of resulting in interactive plots in a jupyter notebook, but it is does not result in easily portable plots. We should consider ways of making the...

zblz

enhancement

good first issue

Consider removal of t-digest computation

Right now the [t-digest](https://github.com/tdunning/t-digest) computation (done using a [python t-digest implementation](https://github.com/CamDavidsonPilon/tdigest)) takes most of the time in generating a summary. The initial motivation to include it was for it to...

zblz

enhancement

discussion

good first issue

Support dask distributed scheduler

The dask distributed scheduler is generally an improvement over the multiprocessing scheduler even in individual multicore machines because of its improved awareness of data locality, so we should consider adding...

zblz

feature

good first issue

Consider partial and resumable computation of summaries

For large datasets where computing the summary may be expensive, it would be useful to compute only part of it, be able to explore it, and then compute other parts...

zblz

feature

discussion

good first issue

lens
lens copied to clipboard

Metadata

Plotly requirements update

Replace deprecated `DataFrame.get_values` with `to_numpy`

pandas==1.0.1 and lens.summarize(df) throws Error

can't install lens despite all solutions i googled

Compatibility issue with new version of plotly?

WIP: Allow dask dataframes as input to lens.summarise

Use matplotlib instead of plotly in Explorer

Consider removal of t-digest computation

Support dask distributed scheduler

Consider partial and resumable computation of summaries

← Metadata

Owner

Metadata

lens lens copied to clipboard

Metadata

← Metadata

Owner

Metadata

lens
lens copied to clipboard