Kenneth Benoit comments

Results 308 comments of


                                            Kenneth Benoit

Restructure chapter outline for including transformers

Add a new section on language models, that include word2vec or GloVe vectors, including ways to get them from spaCy models. Some packages that include transformers/huggingface library access now: -...

textmodel_ca documentation

Can you show your output please?

textmodel_ca documentation

You probably mean ```r > library("quanteda") > dfmat tmod summary(tmod) Length Class Mode sv 7 -none- numeric nd 1 -none- numeric rownames 14 -none- character rowmass 14 -none- numeric rowdist...

Integrate textmodels into quanteda's pkgdown site

See https://github.com/r-lib/pkgdown/issues/1251

Add more language-specific pages

Could add a link to this page, a German-language tutorial set up by Cornelius Puschmann: http://inhaltsanalyse-mit-r.de/0_einleitung.html

Add short section on how to import pre-tokenized text

Since the udpipe output is almost the same structure as that from `spacyr::spacy_parse()`, it can use the method for `as.tokens.spacyr_parsed()`. If you want the POS tag appended, you have to...

Add short section on how to import pre-tokenized text

I thought of that too, then we would just need the equivalents of `docid_field = "doc_id", tokenid_field = "token", pos_field = "pos"` etc. The **udpipe** output almost matches the `spacyr_parsed`...

How to left join docvars with those in an existing corpus

Thanks. More generally (and basically): ```r str(corpus("this is my single document")) ## Error in `[[.corpus`(object, 1L) : ## cannot index docvars this way because none exist ```

How to left join docvars with those in an existing corpus

But keep in mind this, from `?corpus`: > #### A warning on accessing corpus elements > > A corpus currently consists of an S3 specially classed list of elements, but...

How to left join docvars with those in an existing corpus

@conjugateprior Refresh with the latest GitHub version and try it now.