elenasamuylova
elenasamuylova
Hi @LifeBoey, you might find this blog useful https://www.evidentlyai.com/blog/data-drift-detection-large-datasets There, we generate artificial drift and then explore how each statistical test reacts to it. There is also a notebook with...
Hi @echarso, @lowballedintern, we are now starting to work on the beta for Spark integration. I was wondering if any of you'd be open to chatting about how you want...
Hi @that-rahul-guy! This is indeed an old example, with each is not currently maintained. We made several major changes to the library since - specifically, completely rewrote how metrics are...
Hi @SamRodkey, Right now, we are working on an alternative visualization option for large datasets: it will still use Plotly, but generate aggregated plots without retaining the data inside them....
Hi @gakuba, @yanhong-zhao-ef, @MattiaGallegati, @dvirginz, @SamRodkey, we just released the lightweight Evidently reports with aggregated visuals: https://github.com/evidentlyai/evidently/releases/tag/v0.3.2 By default, plots are now aggregated which makes the results HTML smaller. If...
Hi @prity-k, Spark support is currently in development. If you want to test it pre-release, here are the instructions (currently works from several data drift metrics): https://github.com/evidentlyai/evidently/pull/806
Hi @sean-hickey-wf, there is also a remote workspace example here https://github.com/evidentlyai/evidently/tree/main/examples/service
Hi @sean-hickey-wf, Right now the JSON `snapshots` must be stored at the same machine where the Evidently Monitoring service runs. They must be stored in a directory accessible by UI...
Hi @sean-hickey-wf Thanks a lot for sharing - this is very helpful for us to understand what to document better and which examples to add in the future. > Push...
Thanks for sharing @userkkw - it would indeed make sense to pass the `drift_share` parameter to the `DataDriftTable`. We will address this in one of the following releases.