h2o4gpu
h2o4gpu copied to clipboard
Benchmark tSVD on DAI
Lets see if we can use tSVD internally!
- Choose a set of datasets, they should be diverse (different types of columns, different sizes) but preferably something users would use
- Benchmark current DAI impl on those datasets
- Switch DAI code to use our tSVD and rerun the benchmarks
Lets see what we get.
Please also have a look at memory footprint - if we notice that for certain datasets we're running OOM we will need a switch in our wrapper to use scikit instead.
Initial benchmarks on tsvd impl in h2o4gpu:
