andreassot10 comments

Results 10 comments of


                                            andreassot10

mlr3pipelines::PipeOpTextVectorizer is very slow

Thanks @pfistfl. I think that I could help with this- or at least I could try. I'm doing this stuff for work, which means that I cannot allocate all of...

mlr3pipelines::PipeOpTextVectorizer is very slow

Sounds reasonable. I'll take a look this week.

mlr3pipelines::PipeOpTextVectorizer is very slow

@pfistfl , I need a little bit of support on this please, as I'm pretty new to the R6 stuff. So I'm trying to understand what the backends are and...

mlr3pipelines::PipeOpTextVectorizer is very slow

Thanks so much for the detailed response and ideas, @pfistfl. I'm almost convinced that the only way forward would be to break completely free from `data.table`/`data.frame`/`Matrix` formats and create a...

mlr3pipelines::PipeOpTextVectorizer is very slow

So a `dfm` is, according to its authors: _"[...] a type of Matrix-class object with additional slots, described below [in `dfm-class {quanteda}`]. quanteda uses two subclasses of the dfm class,...

mlr3pipelines::PipeOpTextVectorizer is very slow

Thanks, things are much clearer now. As it turns out, it's the conversion from `dfm` to `matrix` with `quanteda::convert` that slows things down in `PipeOpTextVectorizer`: https://github.com/mlr-org/mlr3pipelines/blob/6427f5e9377d7c3d7e1e1aac063c410cffb351b9/R/PipeOpTextVectorizer.R#L239 Converting the `matrix` to...

mlr3pipelines::PipeOpTextVectorizer is very slow

Apologies for the long silence. I'm working on [solution (2)](https://github.com/andreassot10/mlr3extralearners), i.e. build a `mlr3extralearners` version of `quanteda`'s Multinomial NB model that directly incorporates `mlr3pipelines::PipeOpTextVectorizer` in it, to avoid the unnecessary...

andreassot10

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

mlr3pipelines::PipeOpTextVectorizer is very slow

Tuning SMOTE's K with a trafo fails: 'warning("k should be less than sample size!")'

Tuning SMOTE's K with a trafo fails: 'warning("k should be less than sample size!")'

Tuning SMOTE's K with a trafo fails: 'warning("k should be less than sample size!")'