pointblank icon indicating copy to clipboard operation
pointblank copied to clipboard

`scan_data()` is very slow on large datasets

Open hadley opened this issue 1 year ago • 2 comments

Specifically the interactions and correlations sections:

pointblank::scan_data(ggplot2::diamonds, sections = "IC")

Iteractions takes ~30s which isn't too surprising, but correlations takes a very long time, which does surprise me.

hadley avatar Aug 01 '24 16:08 hadley

Agreed, we could definitely make this faster/better. (Part of the entire scan_data() overhaul.)

rich-iannone avatar Aug 02 '24 15:08 rich-iannone

Model improvements to scan_data() off of the python pointblank's DataScan to create a LLM-ingestible summary.

kmasiello avatar Feb 21 '25 15:02 kmasiello